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1 Introduction 

This is the first of a five part work in which we study the Lagrangian and 
Hamiltonian structure of classical field theories with constraints. Our goal is 
to explore some of the connections between initial value constraints and gauge 
transformations in such theories (either relativistic or not). To do this, in the 
course of this book, we develop and use a number of tools from symplectic and 
multisymplectic geometry. Of central importance in our analysis is the notion of 
the "energy-momentum map" associated to the gauge group of a given classical 
field theory. We hope to demonstrate that, as illustrated and explained below, 
many different and apparently unrelated facets of field theories can be thereby 
tied together and understood in an essentially new way. 

Our research is motivated by previous exhaustive studies of several stan- 
dard examples following the methods of Choquet-Bruhat, Lichnerowicz, Dirac- 
Bergmann, and Arnowitt-Deser-Misner. In particular, if one performs an ini- 
tial value (or "canonical" ) analysis of Einstein's theory of gravity (either alone 
or coupled to Dirac, electromagnetic, or other fields), Yang-Mills theory, rel- 
ativistic fluid theories, bosonic string theory, topological field theories, and so 
forth, one finds that they share the following striking features. (See Fischer and 
Marsden [1979a,b], Isenberg and Nester [1980], Kiinzle and Nester [1984], Bao, 
Isenberg, and Yasskin [1985], Batlle, Gomis, and Pons [1986], and Horowitz 
[1989] and references therein for discussions of these and other field theories.) 

1. The Euler-Lagrange equations are underdetermined; that is, there are not 
enough evolution equations to propagate all the field components. This is because 
the theory has gauge freedom. The corresponding gauge group is known at the 
outset. 

As a consequence, some of the fields do not have their evolution determined 
by the Euler-Lagrange equations. These "kinematic fields" have no physical 
significance. Amongst the remaining fields are the "dynamic fields" tp (with 
conjugate momenta p); these are the fields that have physical meaning. 

The situation is further complicated in that the initial data (?/>(0), p(0)) 
cannot be freely specified. 

2. The Euler-Lagrange equations are overdetermined in that they include con- 
straints 

&y>,p) = o (i.i) 

on the choice of Cauchy data. 

Equations l|l. ljl typically comprise an elliptic system whose component equa- 
tions can be of variable order. For simplicity we suppose that all constraints 
are "first class" in the terminology of Dirac [1964]. (The case when second class 
constraints are present will be discussed as the need arises.) 

It is remarkable that the Euler-Lagrange equations are simultaneously un- 
derdetermined and overdetermined. The next feature ties these two properties 
together. 
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3. The constraint functions generate gauge transformations of the dynamic 
fields via the canonical symplectic structure on the space of Cauchy data. 

Thus, the presence of (first class) initial value constraints is intimately re- 
lated to the gauge freedom of the theory. Since gauge transformations are part 
of the dynamics and since the constraints generate gauge transformations, one 
expects the functions to appear in the Hamiltonian. Indeed, in these exam- 
ples, we find: 

4. The Hamiltonian (with respect to a slicing of spacetime, usually taken to 
consist of Cauchy surfaces) has the form 



depending linearly on the "atlas fields " on . Here £ is a hypersurface modeling 
a typical member of the given slicing. 

The atlas fields, discussed in more detail later, are closely related to the 
kinematic fields mentioned above in item 1. (Being sections of an associated 
bundle, they may be regarded as certain combinations of the kinematic fields 
and elements of the Lie algebra of the gauge group.) The atlas fields are arbi- 
trarily specifiable; intuitively, one can view them as governing the propagation 
of coordinates on a Cauchy surface (and on bundles of fields over a Cauchy 
surface) in time — hence the name. Their importance derives from the fact that 
they "drive" the entire gauge ambiguity of the theory (cf. the next item). 

Introduce an L 2 inner product on the space of Cauchy data (with correspond- 
ing adjoint denoted by *), and let JJ be an almost complex structure compatible 
with both the given inner product and the symplectic form. Let A be a pa- 
rameter ("time") labeling the slicing mentioned above. From (|1.2|) . it follows 
that 

5. The evolution equations for the dynamic fields (tp,p) take the "adjoint form" 



Equations i|1.3fl are typically a hyperbolic system; hence, for fixed ai(A), 
they uniquely determine the evolution of the dynamic fields and their conjugate 
momenta from given initial data. Here, the atlas fields a^(A) are to be specified 
ab initio; this amounts to a gauge-fixing. Equations (|1.3|) display, in the clearest 
and most concise way, the interrelations between the dynamics, the initial value 
constraints, and the gauge ambiguity of a theory. 

The constraint equations are preserved by the evolution equations 1|1.3[) . 
Of course, 

6. The covariant Euler-Lagrange equations are equivalent, modulo gauge trans- 
formations, to the combined evolution and constraint equations (|1.3[1 and (II. 111 . 




(1.2) 




(1.3) 
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This result, together with items 4 and 5, constitute the theoretical back- 
ground necessary to analyze the structure of the solution space of the Euler- 
Lagrange equations. For the theories mentioned above, we have: 

7. The space of solutions of the field equations is not necessarily a smooth mani- 
fold. It may have quadratic singularities occurring exactly at those solutions that 
are symmetric (in the sense of being invariant under the action of a nondiscrete 
subgroup of the gauge group) . 

See Fischer, Marsden, and Moncrief [1980], Arms, Marsden, and Moncrief 
[1981,1982], Marsden [1988], Arms [1977,1981], Arms, Gotay, and Jennings 
[1990], Arms, Cushman, and Gotay [1991], Sjamaar and Lerman [1991], and 
Interlude III for more information on the meaning of item 7. These results 
are an outgrowth of the extensive literature on linearization stability, which 
concerns the success or failure of the linearization process as far as predicting the 
valid directions of perturbation theory, as well as that on symplectic reduction, 
which is the means of eliminating those degrees of freedom of a system that are 
associated with symmetries. 

Noether's theorem (aka the "first Noether theorem") and the formal Dirac- 
Bergmann analysis of constraints do much to predict and explain features 1-6. 
However, we wish to go further and provide (realistic) sufficient conditions which 
guarantee that they must occur in a given field theory. In this work we provide 
such criteria for the first six conditions and lay the groundwork needed for the 
analysis of the seventh. One of the goals is thus to derive the "adjoint formalism" 
for classical field theories. 

A key tool throughout this work is the "energy-momentum map," which we 
define in Part II. A recurrent theme in this work is that the energy-momentum 
map encodes essentially all the dynamical information carried by a given classi- 
cal field theory: its Hamiltonian, its initial value constraints, its gauge freedom, 
and even its stress-energy-momentum tensor. Indeed, in Part III we shall prove 

The Energy-Momentum Theorem. The constraints (|1.1|) are 
given by the vanishing of the energy-momentum map associated to 
the gauge group of the theory. 

Thus the constraint functions are "components" of the energy-momentum 
map. Item 4 then shows that the same is true for the Hamiltonian. 

Certain of these results are known in specific instances and have to some 
extent acquired "folk theorem" status. But we wish to prove that they hold 
rather generally. In this regard we emphasize that the energy-momentum map 
does more than merely provide reformulations of known results; it leads to 
fundamental insights into the foundations of physical theories. In view of their 
importance — and ubiquity — in classical field theory, one is tempted to elevate 
the statement "momentum maps are everything" to a general principle! 

To understand the role played by the energy-momentum map, recall the 
two traditional approaches to classical field theory, which we may characterize 
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as "group-theoretical" and "canonical." The group-theoretical approach is con- 
cerned with the gauge covariance of a given system, and is based upon Noether's 
theorem. It operates on the (spacetime) covariant level, and is usually phrased 
in Lagrangian terms. (See Trautman [1967], Olver [1986], and references con- 
tained therein for relevant background.) In contrast, the canonical approach is 
used to analyze the initial value problem in a Hamiltonian setting; the basic 
tool here is the Dirac-Bergmann theory of constraints. Such an analysis re- 
quires that the theory be space + time decomposed relative to a fixed choice of 
Cauchy surface in spacetime. (See Dirac [1964], Hanson, Regge, and Teitelboim 
[1976], Gotay, Nester, and Hinds [1978], Isenberg and Nester [1980], as well as 
the review by Sundermeyer [1982] for detailed accounts.) 

The group-theoretical and canonical aspects of a mechanical system are 
linked by the momentum map (see, for example, Abraham and Marsden [1978] 
and Marsden and Ratiu [1999]). One would like to have an analogous connec- 
tion in field theory which, in particular, relates gauge symmetries to initial value 
constraints. But — and this is a crucial point — the standard notion of a momen- 
tum map associated to a symplectic group action usually cannot be carried over 
to spacetime covariant field theory. The reason is that the gauge group typically 
does not act in the space + time decomposed (or "instantaneous") framework, 
because spacetime diffeomorphisms move Cauchy surfaces. Because the Hamil- 
tonian formalism is only defined relative to a fixed Cauchy surface, there is no 
possibility of obtaining a momentum map for the gauge group in the usual sense 
on the instantaneous phase space. 1 

The prime example where this situation arises is Einstein's theory of vac- 
uum gravity, in which case the gauge group is the spacetime diffcomorphism 
group. The only remnants of this group on the instantaneous level are, in the 
ADM (Arnowitt, Deser and Misner [1962]) language, the superhamiltonian Sj 
and supermomenta Z which are interpreted as the generators of temporal and 
spatial deformations of a Cauchy surface, respectively. But these deformations 
do not form a group, nor are f) and Z components of a momentum map. (See 
Kuchaf [1973] for further details.) More generally, this difficulty appears for any 
theory which is "parametrized" in that its gauge group "includes" the spacetime 
diffcomorphism group (in an appropriate sense). 

This circumstance forces us to work on the covariant level. But it is then 
necessary to construct a covariant counterpart to the instantaneous Hamilto- 
nian formalism. In the spacetime covariant (or "multisymplectic" ) framework 
we develop here — which is an extension and refinement of the formalism of Ki- 
jowski and Szczyrba [1976] — the gauge group does act. This enables us to define 
the notion of a covariant (or "multi-") momentum map on the corresponding 
covariant (or "multi-") phase space. This covariant momentum map satisfies 
Noether's theorem, and has all the properties one would expect. It remains to 
correlate this object with the dynamics once a Cauchy surface has been sin- 
gled out. To this end, a careful study of the mechanics of the space + time 

1 It is possible that a momentum map in the sense of groupoids may be appropriate for 
this setting. (See, for instance, Mikami and Weinstein [1988].) This is, however, a subject for 
future research. 
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decomposition shows that the covariant momentum map induces an "energy- 
momentum" map $ on the instantaneous phase space. It is not a genuine 
momentum map, as there is no group action in this context; nonetheless, $ is 
the instantaneous "shadow" of a covariant momentum map, and this is enough. 
The energy-momentum map is the crucial object that reflects the gauge trans- 
formation covariance of a classical field theory in the instantaneous formalism. 
In the ADM formulation of gravity, $ = — (Sj,Z), so that the super hamiltonian 
and supermomenta are the components of the energy-momentum map. 

The energy-momentum map thus synthesizes the group-theoretical and ca- 
nonical approaches to classical field theory. Consonant with this observation, 
we view the gauge group of a theory as being fundamental, and suppose it to 
be known ab initio. This is certainly the case in the standard examples cited 
previously. Our plan is then to use the energy-momentum map associated to 
the gauge group of the theory to study, and indeed derive, items 1-6 above. 
This shift in viewpoint proves to be surprisingly fruitful, enabling us to obtain 
deeper results than by the traditional methods alone. 

The theoretical underpinnings of the adjoint formalism for classical field 
theories consist of four components: 

I. a covariant analysis of field theories; 

II. a space + time decomposition of the covariant formalism followed by an 
initial value analysis of field theories; 

III. a study of the relations between gauge symmetries and initial value con- 
straints; and 

IV. the actual derivation of the adjoint formalism. 

The first two components constitute the foundation for the last two, where most 
of the hard work is concentrated. We present each of these four components as a 
paper in this series, starting with the covariant analysis. The fifth paper in this 
series applies our formalism to Einstein's theory of gravity. Before beginning 
Part I, we now comment briefly on each of the five parts, in turn. (For the 
convenience of the reader, we append a table of contents of Parts II-V at the 
end of this paper.) 

In Part I we develop some of the basic theory of classical fields from a 
spacetime covariant viewpoint. Throughout we restrict attention to first or- 
der theories; these are theories whose Lagrangians involve no higher than first 
derivatives of the fields. 

We begin in Chapter 2 of Part I with a study of the covariant Lagrangian 
and Hamiltonian formalisms, mirroring the approach usually taken in classical 
mechanics. Let X be an (n + l)-dimensional parameter space, which in appli- 
cations is usually spacetime. We suppose that we are given a bundle Y over 
X whose sections are the fields of interest; Y is the covariant configuration 
bundle. The field-theoretic analogue of the tangent bundle of mechanics is the 
first jet bundle J X Y of Y . By taking an appropriate dual of J X Y we obtain 
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the space Z which is, roughly speaking, the covariant cotangent bundle of the 
bundle Y. As such, it carries a canonical (n+l)-form 6. The multisymplectic 
(n + 2)-form ft = —d<d is the covariant generalization of the symplectic form in 
Hamiltonian mechanics. The pair (Z, f2) is called multiphase space. 

Next, in Chapter 3, we introduce a Lagrangian density £ on J Y Y . The 
covariant Lagrangian and Hamiltonian formalisms are then related by the co- 
variant Legendre transformation FL : J X Y — ► Z which is defined, as in 
mechanics, by means of the fiber derivative of L. Pulling the form £1 on Z 
back by FZ we obtain the Cartan form on J X Y (also variously known as the 
"Hamilton-Cartan" and "Poincare-Cartan" form). We use it to construct an 
intrinsic formulation of the variational calculus and the Euler-Lagrange equa- 
tions, cf. Goldschmidt and Sternberg [1973] and Garcia [1974]. 

Our approach to covariant field theory has a number of advantages over other 
formulations. For instance, unlike those of Kijowski and Tulczyjew [1979], Ra- 
gionieri and Ricci [1981], Giinther [1987], Sardanashvily [1993], and Kanatchikov 
[1998], our multisymplectic structure and corresponding covariant Hamiltonian 
formalism are intrinsically defined, independent of other constructs (such as a 
background connection). Other formalisms such as those of Goldschmidt and 
Sternberg [1973] and Sniatycki [1984], although intrinsic, do not carry multi- 
symplectic structures at all. The approach we take here is also promising for 
higher order field theories, which are plagued by a number of ambiguities; see 
Gotay [1991a, b] and references therein. Perhaps most importantly, our approach 
allows us to define covariant momentum maps for groups that act nontrivially 
on X. Such group actions also arise naturally in fluid dynamics (Moreau [1982]) 
and continuum mechanics (Lew, Marsdcn, Ortiz, and West [2002]) in relation 
to taking "horizontal variations" of the action. 

In Chapter 4 we discuss symmetries and conservation laws of classical field 
theories in terms of "covariant momentum maps," generalizing the concept of 
momentum map familiar from mechanics. If one introduces a Lie group 9 (pos- 
sibly infinite-dimensional) with Lie algebra g that acts on (Z, Q) in an appro- 
priate way, then one can define a multi- or covariant momentum map 
J : Z — > q* ® A n Z that intertwines the group action with the multisymplectic 
structure via the equation 

i Z A n = d(j,o, 

where (J, £) denotes J paired with £ £ g, £z is the infinitesimal generator on Z 
of the one-parameter group generated by £, and d is the exterior derivative. It 
often happens that such group actions are lifted from actions on Y, in which case 
one has an explicit formula for J; see §4C. Then in §4D we prove Noether's the- 
orem, which provides the basic relation between symmetries of the Lagrangian 
and conserved quantities. Our multisymplectic formulation of Noether's theo- 
rem is specifically designed with later applications to the relationship between 
constraints and the momentum map in mind. Here we also state and prove a 
converse to Noether's theorem, subject to a certain transitivity assumption on 
the group action. 
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All the discussion in Part I is space-time covariant. In Part II we use space 
+ time decompositions to reformulate classical field theories as infinite- 
dimensional dynamical systems. The transition from the multisymplectic to 
the instantaneous formalism once a Cauchy surface (or a slicing by Cauchy sur- 
faces) has been chosen is essential to define the energy-momentum map and 
thence to cast the field dynamics into adjoint form. 

Consider an n-dimensional Cauchy surface X C X (assumed compact and 
boundaryless). In the instantaneous context, the configuration space for field 
dynamics is the space of sections of Y restricted to X, and the phase space 
is T*"^y, with its canonical symplectic form. We emphasize that this symplectic 
structure lives on the space of Cauchy data for the evolution equations, and not 
on the space of solutions thereof (as in, e.g., Zuckerman [1987] and Crnkovic 
and Witten [1987]). This is an important distinction, as the former space is 
typically better behaved than the latter (cf. item 7 as well as the discussion in 
Horowitz [1989]). It also obviates the problem of constructing a differentiable 
structure on the space of solutions, as was attempted by Kijowski and Szczyrba 
[1976]. 

In Part II, the main result of Chapter 5 is that the multisymplectic structure 
on Z induces the canonical symplectic structure on T*^. To bridge the gap 
between the covariant and instantaneous formalisms, we introduce an interme- 
diate space Ze, the space of sections of Z restricted to X. The multisymplectic 
(n + 2)-form f2 on Z induces a presymplectic 2-form 51s on %s by integra- 
tion over X. Reducing Zs by the kernel of fis produces a symplectic manifold 
Zs/(kerHs), which we prove is canonically symplectomorphic to T*^- 

This shows how to space + time decompose covariant multisymplectic phase 
spaces into instantaneous symplectic phase spaces. Next, in §§6A-D, we per- 
form a similar decomposition of dynamics using the notion of slicings. An 
important result here states that the dynamics is compatible with the space + 
time decomposition in the following sense: Solutions of the instantaneous Hamil- 
ton equations correspond directly to solutions of the covariant Euler-Lagrange 
equations and vice versa. This fact — a prerequisite for establishing item 6 — is 
usually taken for granted in the literature. Here we provide a proof, assuming 
only mild regularity assumptions that are satisfied in cases of interest. 

An important tool throughout this work is the initial value analysis. One 
of the hallmarks of field theories is that not all of the Euler-Lagrange equations 
necessarily describe the temporal evolution of fields; some of them may impose 
constraints on the choice of initial data. The constraints that are first class 
reflect the gauge symmetry of the theory and eventually will be related to the 
vanishing of various (energy-) momentum maps. Thus, it is important to fully 
analyze the roles of constraints and gauge transformations in dynamics and the 
relations between them. To this end, we utilize the symplectic version of the 
Dirac-Bergmann treatment of degenerate Hamiltonian systems (patterned after 
Gotay, Nester, and Hinds [1978], cf. also Sundermeyer [1982]). This formalism, 
which we summarize in §6E, yields an essentially complete understanding of 
items 1—3, and lays the groundwork for correlating the first class constraints 
with the gauge group of the theory via the energy- momentum map. 



8 §1 Introduction 

In Chapter 7 we space + time decompose multimomentum maps associated 
to covariant group actions, thereby correlating these objects with momentum 
mappings (in the usual sense) in the instantaneous formalism. Thus, suppose 
that S is a group of automorphisms of Y. As emphasized earlier, 9 usually does 
not act on or on T*y% as it need not stabilize S. Nonetheless, by integrating 
over S, the covariant momentum map J gives rise to a map Zs — » 0*, which 
can then be shown to project to a well-defined map £s : — > g*, where the 
"primary constraint set" 3>s is the image of T^s in T*^^ under the instanta- 
neous Legendre transform. This map £s is the energy-momentum map. It 
is typically not a true momentum map. However, if one considers only the sub- 
group Ss of 9 that stabilizes the Cauchy surface (and that therefore acts in the 
instantaneous formalism), then £s restricts to a genuine momentum map for the 
9s-action on T*^s that corresponds in relativity to the supermomenta, without 
the inclusion of the supcrhamiltonian. In the general case £s corresponds to 
both Sj and 3- 

Parts I and II provide the requisite background for the study of items 4-6, 
which we carry out in Parts III and IV. Part III, in particular, focuses on estab- 
lishing a relationship between the initial value constraints on the one hand, and 
the energy-momentum map corresponding to the gauge transformations on the 
other. (Sec the Energy-Momentum Theorem.) We begin the study of this rela- 
tionship in Chapter 8 of Part III, in which we characterize the "gauge group" 
of a given classical field theory. Surprisingly, this notion, although familiar 
and intuitive, has never been precisely defined in any generality. It is — for the 
moment to be distinguished from the notion of "instantaneous gauge trans- 
formation" a la Dirac that was alluded to above. Of course, one of the main 
goals of Part III is to show that these two notions of "gauge" are in fact the 
same. This will follow once item 3 and the Energy-Momentum Theorem are 
established. 

The gauge group 9 is taken to be a subgroup of Aut(y), the group of 
automorphisms of the bundle Y. It must satisfy certain properties, viz.: 

(i) 9 acts by symmetries, for example, the Lagrangian density L is 9- 
equi variant. 

(ii) 9 is localizable, which intuitively means that elements of 9 can be "turned 
off" in the "future" or the "past." This property is crucial; it is what 
distinguishes a gauge group from a mere symmetry group. 

(iii) 9 is the largest such subgroup of Aut(F). 

In §8 A we give a principal bundle construction of 9- 

One should bear in mind, however, that some field theories may have no 
gauge freedom (or initial value constraints) at all; the Klein-Gordon field on 
a nonintcracting background spacetime is a simple example of such a system. 
Theories like this can be subsumed into our formalism by requiring them to be 
parametrized, in the sense that 9 maps onto Diff(A) (or at least a localizable 
subgroup thereof) under the natural projection Aut(Y) — > Diff(A). Thus, the 
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gauge groups of parametrized theories are always nontrivial. Many field theories 
are "already" parametrized (e.g., the Polyakov string, topological field theories, 
gravity). In any case, this requirement involves no loss of generality, since a 
theory that is not parametrized can often be made so, for example, by simply 
introducing a metric on X and treating it as an auxiliary variable (which may 
or may not be variational in the sense that it is to be varied in the action 
principle to obtain Euler-Lagrange equations) , or else by coupling it to gravity. 
Kuchaf [1973] gives an alternate way of parametrizing a theory. Henceforth we 
assume that the systems under consideration are parametrized. (Although our 
formalism is best suited to parametrized theories, much of it — with appropriate 
modifications will apply to "background" theories as well. These points are 
discussed further in the text; see also Interlude I.) 

We start building towards the main results of Part III by establishing the 
following result, which is basic to the entire development. Fix a hypersurface 
£ and an element £ g g with the property that its associated spacetime vector 
field £x is transverse to S. (Such a vector field exists since the theory is param- 
etrized.) The Lie algebra element £ may be thought of as defining an "evolution 
direction"; it is analogous to the "lapse" and "shift" in relativity. In §7F we 
show that the instantaneous Hamiltonian density H-^q on CPs corresponding to 
the evolution direction £ S g is given by 

ff E ,c = -(£ s ,0- (i-4) 

Thus the instantaneous Hamiltonian is a component of £s (hence the appella- 
tion energy-momentum map). It follows that £s actually generates the dynam- 
ics of the system. This result is familiar in the ADM formulation of gravity: 
using the fact that £ s = -(f), 3) and setting ( = (N, M) e X(X), reduces 
to 

ffs,(iV,M) = Nfj + M ■ 3- 
Equation l|1.4|l forms the basis for item 4; in this regard, the fact that -ffs.c is 
manifestly linear in £ presages its linearity in the atlas fields, discussed later for 
general field theories. 

Next, in Chapter 9, we prove the Vanishing Theorem (aka the "second 
Noether theorem"), which states that on any solution of the Euler-Lagrange 
equations, the multimomentum map integrated over a hypersurface is zero. This 
result follows from Noether's theorem, S-equivariance, and localizability. The 
Vanishing Theorem implies that the energy-momentum map must vanish on 
admissible Cauchy data for the evolution equations. (The fact that £s must 
vanish, as opposed to merely being constant, is a direct consequence of localiz- 
ability.) 

With these results in hand, in Chapter 10 we establish the Energy- Momen- 
tum Theorem, which asserts that for any field theory which is first class and 
satisfies certain technical conditions, the set Cs of admissible Cauchy data for 
the evolution equations is exactly the zero level set of the energy-momentum 
map associated to the gauge group: 

C s = £ s 1 (0). (1.5) 
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The set Cs is known as the "final constraint set" in Dirac's terminology. More 
succinctly, this theorem says that the initial value constraints are given by the 
vanishing of the energy-momentum map. Although often quoted in the liter- 
ature, to our knowledge no proof of this fact has ever been given. It may 
at first appear that ljl.5|l is "just" the Vanishing Theorem, but this is not the 
case. Certainly the Vanishing Theorem implies that the vanishing of the energy- 
momentum map yields initial value constraints, which can then be shown to be 
first class. But it is far from obvious that in fact all such constraints arise in 
this fashion. Moreover, the proper context for the second Noether theorem is 
the multisymplectic formalism described above. Only in this setting is one able 
to obtain constraints that are associated to gauge transformations that move 
Cauchy surfaces in spacetime, such as the superhamiltonian constraint in general 
relativity. It should be observed that other covariant Hamiltonian formalisms 
are deficient in this regard, for example, that of Kijowski and Tulczyjew [1979]. 

One corollary of the Energy-Momentum Theorem is that since 3^0, first 
class constraints are always present in parametrized theories (in which all fields 
are variational). Thus, with this proviso, items 1—3 are never vacuous. Another 
corollary is that the covariant and instantaneous notions of gauge coincide; more 
precisely, when space + time decomposed, the gauge group generates exactly 
the instantaneous gauge transformations in the sense of the Dirac-Bergmann 
constraint theory. This indicates that the above definition of S is "correct." 

There are calculational advantages that accrue from this theorem as well. 
For instance, one can always compute the initial value constraints of a given 
field theory according to the Dirac-Bergmann procedure. But the ensuing cal- 
culations can be quite involved, and the results are not always easy to interpret. 
An analysis based on the gauge group and the energy-momentum map, on the 
other hand, is often substantially simpler — and has the advantage of attaching 
a group-theoretical interpretation to the constraints. A case in point is again 
provided by ADM gravity. If one merely carries out the Dirac-Bergmann analy- 
sis, one obtains the superhamiltonian and supermomentum constraints, but it is 
entirely unclear from this analysis what their geometric significance is. Only by 
"hindsight" is one able to correlate these constraints with temporal and spatial 
deformations of a Cauchy surface, reflecting the 4-dimensional diffeomorphism 
gauge freedom of the theory. The problem is that the spacetime diffeomorphism 
group is "hidden" once one performs a space + time decomposition — a neces- 
sary prerequisite to the initial value analysis. But our procedure yields these 
constraints rather routinely, and their geometric meaning is clear from the be- 
ginning. (See Part V for the application of our techniques to gravity.) Thus, 
one has a method for identifying the final constraint set that does not in prin- 
ciple require one to invoke the entire constraint preservation algorithm. The 
criteria which must be checked are not always easy to verify but, in any case, 
the theorems of Chapter 10 do guarantee an equivalence between the results 
of the Dirac-Bergmann algorithm and the results of an analysis based on the 
energy-momentum map of a given gauge group action. 

The Energy-Momentum Theorem is really a statement about (first class) 
secondary constraints, as £s is only defined on the primary constraint set 
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IPs C In Chapter 11 we show that (first class) primary constraints 

can be recovered in much the same way, but the proof is more difficult and 
requires different techniques. For the primaries the relevant object is not S but 
rather a certain foliation Se of derived from S; intuitively, Ss consists of 
the "time derivatives" of elements of the subgroup Ss which stabilizes S. This 
foliation has a "momentum map" 3s and, provided zero is a regular value, the 
primary constraints are obtained by equating 3n to zero. It is possible — with 
some additional assumptions on the nature of the gauge group — to realize this 
foliation as the orbits of a genuine subgroup of Ss; this is the subject of §12B. 

We have thus far seen that both the Hamiltonian and the initial value con- 
straints are "components" of the energy-momentum map. In Interlude II we 
show, following Gotay and Marsden [1992], that the stress-energy-momentum 
tensor of a given collection of fields arises in a similar manner, again demon- 
strating the capacity of the multimomentum map to unify disparate concepts 
into a single geometric entity. The stress-energy-momentum tensor so ob- 
tained satisfies a generalized version of the classical "Bclinfantc Roscnfcld for- 
mula" (Bclinfante [1940], Roscnfcld [1940]), and hence naturally incorporates 
the canonical stress-energy-momentum tensor and the "correction terms" that 
are necessary to make the latter gauge-covariant. Furthermore, in the presence 
of a metric on X, our coincides with the Hilbert stress-energy-momentum 
tensor and hence is automatically symmetric. 

Part IV derives the adjoint formalism for field theories. This is the capstone 
of our work, where all the various aspects of a classical field theory meld into 
a coherent whole. The adjoint formalism is the starting point for investigations 
into the structure of the space of solutions of the Euler-Lagrange equations, 
linearization stability, quantization, and related questions. 

To accomplish this objective, in Chapter 12 we distinguish two types of field 
variables that play an especially important role in the initial value analysis. The 
dynamic fields are those whose evolution from given initial data is determined 
by well-posed evolutionary equations. These are the fields that carry the dynam- 
ical content of the theory and with which the adjoint formalism is concerned. 
By way of contrast, the evolution of the kinematic fields is determined not 
by the Euler-Lagrange equations, but rather by the way the spacetime X and 
the covariant configuration bundle Y are sliced. In fact, the kinematic fields are 
closely related to the first class primary constraints and their existence can be 
directly attributed to the presence of gauge symmetries. 

The space of dynamic fields is Ds = ^s/Ss- The kinematic fields are 
abstractly realized as sections of this quotient; this definition makes sense as 
the primary constraints are components of the momentum map associated to 
the foliation Ss, so their canonically conjugate fields are parametrized by Ss- 
By a slight generalization of the cotangent bundle reduction theorem (Abraham 
and Marsden [1978], Kummcr [1981], Marsden [1992]), the instantaneous phase 
space T*^y, can then be cut down to the space T*D^ of dynamic fields -0 and 
their conjugate momenta p. (Physically, this reduction consists of imposing the 
primary constraints and then factoring out the kinematic fields.) 
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The goal is to project the dynamics on the primary constraint set in T*^y. 
to parametric dynamics on T*T>y,- The parameters will be certain combinations 
of the kinematic fields and elements of g which we call "atlas fields." 

Unfortunately, the energy-momentum map £s on 7s does not directly pro- 
ject to T*De, as it involves the kinematic fields. But £s, thought of as a map 
7y. x — > R, also depends upon ( £ 5. By algebraically lumping the kine- 
matic fields together with elements of the Lie algebra of the gauge group in a 
"Ss-equi variant" fashion, one forms the nondynamic atlas fields on. Their 
abstract definition turns out to be rather delicate, and necessitates some tech- 
nical assumptions regarding the action of the gauge group on Y . In any case, 
the atlas fields are constructed in such a way that £s depends only upon (ip, p) 
and the a,;. The energy-momentum map then can be pushed down to a func- 
tion $5] on T*T>y, depending upon the parameters on. This function <5>s is the 
"reduced" energy-momentum map. We observe that the Energy-Momentum 
Theorem remains valid in this context, since the secondary constraints restrict 
only (ip,p) — the kinematic fields being correlated with the primary constraints. 
Combining these results with (|1.4|) then leads to item 4. 

The essential content of item 4 is not so much that the Hamiltonian can be 
expressed in terms of the constraints, as noteworthy as that is. (Indeed, this 
is already apparent from Ql-4j .) Rather, it is the fact that the Hamiltonian 
is linear in the atlas fields. This is vital for the adjoint form of the 

evolution equations, which now follows readily from (|1.2I) by writing Hamilton's 
equations explicitly on T*T>s. 

The salient features of the adjoint form Ijl.^p are: 

• The constraints generate the dynamics, via the appearance of the energy- 
momentum map in l|1.3fl . 

• The atlas fields "drive" the entire gauge ambiguity of the theory. This is 
a remarkable result, especially when viewed from the standpoint of Dirac 
constraint theory: no matter how complex the cascade of constraints, 
the gauge freedom generated by the totality of first class constraints is 
encapsulated in the fields aj (which are closely linked to the first class 
primaries). 

• The gauge freedom of the theory is completely subsumed in the choice 
of slicing. This is convenient computationally, as the form of the adjoint 
equations l|1.3fl is ideally suited for studying evolution relative to various 
slicings. 

• The dynamics is independent of the choice of slicing, which is a conse- 
quence of the preceding remark. 

These statements are familiar in the case of the ADM formulation of gravity, 
but our results are more general and include theories which have "internal" (i.e., 
not spacetime-based) gauge freedom, such as Yang-Mills theory. 

The independence of the adjoint formalism on the choice of slicing together 
with the results of Chapter 6 give item 6. The adjoint formalism is now ready 
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to be used for such purposes as the singularity analysis of the solution space, as 
we have mentioned. 

The results of this work apply to a large class of systems including most 
field theories of current interest. To bolster this claim, throughout the work 
we provide a number of illustrations of the formalism as we develop it. These 
examples include particle mechanics, Maxwell's theory of electromagnetism, a 
topological field theory, and bosonic string theory. In Part V we present (the 
Palatini variant of) Einstein's theory of gravity as a stand-alone example. 

All these examples have first order Lagrangians and, except for Palatini 
gravity and the topological field theory, all of them involve only constraints 
which are first class in the sense of Dirac. Indeed, many of the theorems as stated 
in this work are restricted to field theories with these properties. However, we do 
believe that the essential thrust of this work holds for more general classical field 
theories. Second class constraints, for example, should not cause trouble. The 
main effect of the presence of second class constraints is the failure of equality in 
(jl.5|l : we are only guaranteed that Cs C £^ 1 (0)- As a consequence, one can no 
longer compute the final constraint set using the energy-momentum map alone. 
However, the strong tie between first class constraints and gauge transformations 
should remain, and one should still be able to provide theoretical support for 
items 1-7, including the adjoint form l|1.3[) . for such theories. Note that one way 
to deal with second class constraints is to eliminate them via a Dirac bracket 
construction (Sniatycki [1974]); while this presents no problem in principle, it 
is sometimes unsatisfactory from a computational standpoint. Another way 
to handle second class constraints might be based on the work of Lusanna 
[1991,1993], which relates them to transformations that leave the Lagrangian 
gwosj-invariant. Note also that in well-known field theories with second class 
constraints — for example, the Einstein-Dirac theory (see Bao, Isenberg, and 
Yasskin [1985], and Bao [1984]) and the KdV equation (see Gotay [1988])— 
those constraints do not introduce undue difficulties. 

We also expect — modulo some technical difficulties — that the formalism de- 
veloped here can be straightforwardly applied to higher order field theories. 
Such a generalization is necessary in particular to directly treat Einstein's the- 
ory of gravity, since the Hilbert Lagrangian is second order in the metric. In 
fact, the multisymplectic setup and its space + time decomposition have already 
been extended to field theories of any order in Gotay [1988, 1991a, b]; see also 
Kouranbaeva and Shkoller [2000]. However, for third or higher order systems 
the Cartan form is not uniquely determined by the Lagrangian (Kolaf [1984]). 
This circumstance bears on an important fact which underlies our analysis; viz. 
for first (or second) order theories, a symmetry of the Lagrangian is also a sym- 
metry of the Cartan form. (This is essential for Noether's theorem among other 
things.) Thus in the higher order case it is necessary to determine whether there 
exist S-invariant Cartan forms. The work of Muhoz [1985] may be helpful in 
this context. 

We believe that one further generalization should hold. Namely, the prin- 
cipal ideas of this work should apply to cases in which one is naturally led to 
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consider a Poisson, as opposed to a symplectic structure, such as in certain 
formulations of general relativistic perfect fluids and plasmas (cf. Bao, Mars- 
den, and Walton [1985] and references therein). It could be useful to develop 
a covariant analogue of Poisson manifolds (perhaps along the lines of Marsden, 
Montgomery, Morrison, and Thompson [1986]) and to develop a version of co- 
variant reduction that could start with the multisymplectic ideas of this work 
and produce Poisson structures as in the nonrelativistic case. (See, for example, 
Marsden, Weinstein, Ratiu, Schmid, and Spencer [1983] and Marsden [1992] for 
a summary of the nonrelativistic reduction theory, and Holm [1985] for indica- 
tions of how the relativistic theory should proceed.) Recent results on reduction 
for field theories can be found in Castrillon and Marsden [2002] . 

Finally, we remark that multisymplectic geometry may be used for other 
interesting purposes besides the investigation of the intrinsic structure of clas- 
sical field theories. In particular, Marsden and Shkollcr [1999] and Marsden, 
Patrick, and Shkollcr [1998] have shown that Bridges' [1997] treatment of water 
waves and their instabilities may be understood in multisymplectic terms and 
that one can build numerical integrators using these ideas. One can also do 
continuum mechanics from the multisymplectic point of view as in Marsden, 
Pekarsky, Shkollcr, and West [2001]; this leads to a natural way of incorporat- 
ing 'constraints' such as incompressibility. Covariant techniques are also proving 
useful in understanding the dynamical structure of the gravitational field; see 
Ashtekar, Bombelli, and Reula [1991] and Anco and Tung [2002] for some results 
in this direction. 
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I — C o variant Field Theory 



In this part we develop some of the basic theory of classical fields from a co- 
variant viewpoint. This is done in the framework of multisymplectic geometry, 
which is analogous to the geometry of the cotangent bundle in classical non- 
relativistic mechanics — which we often call particle mechanics (even though it 
also includes continuum mechanics). We also develop the geometry of the jet 
bundle and the Euler-Lagrange equations, which is analogous to the geometry 
of the tangent bundle and the Lagrange equations of motion in the case of par- 
ticle mechanics. The final chapter of this part discusses conservation laws and 
Noether's theorem from the point of view of covariant momentum maps, again 
generalizing the concept of momentum map familiar from particle mechanics. 



2 Multisymplectic Manifolds 

There are two major approaches to the symplectic description of classical field 
theory, the covariant (or "multisymplectic") formalism and the instantaneous 
(or "3+1") framework. In the latter approach dynamics is described in terms 
of the m/imie-dimensional space of fields at a given instant of time, whereas in 
the former dynamics is phrased in the context of the /zmie-dimensional space 
of fields at a given event in spacetimc. Each formalism has its own advantages. 

In this and the next several sections, we summarize those aspects of the mul- 
tisymplectic formalism and its relation to the 3+1 framework that will be impor- 
tant for subsequent developments. There are many recent references for the mul- 
tisymplectic formalism, such as Dedecker [1953,1977], Gawedzki [1972], Gold- 
schmidt and Sternberg [1973] (see also Guillemin and Sternberg [1977,1984]), 
Kijowski [1973,1974], Garcia [1974], Kijowski and Szczyrba [1975,1976], Ki- 
jowski and Tulczyjew [1979], Aldaya and de Azcarraga [1980a], Kupcrshmidt 
[1980], Kosmann-Schwarzbach [1981], Ragionieri and Ricci [1981], Kastrup 
[1983], Sniatycki [1970a, 1984], de Leon and Rodrigues [1985], Gimther [1987], 
Gotay [1991a, b], Sardanashvily [1993], Kanatchikov [1998], Marsden, Patrick, 
and Shkoller [1998], and Castrillon and Marsden [2002]. Historically, much 
credit should also be given to Cartan [1922], De Dondcr [1930], Weyl [1935] 
and Lepage [1936]. Our presentation here is self-contained, but not exhaustive; 
topics not needed for our main results as well as alternative approaches have 
been omitted. 

Throughout this work, all manifolds and maps are assumed to be of class C°° . 
Our conventions follow Misner, Thornc, and Wheeler [1973] to a large extent. 
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2A The Jet Bundle 

Let X be an oriented manifold, which in many examples is spacetime, and 
let ttxy ■ Y — * X be a finite-dimensional fiber bundle called the covariant 
configuration bundle. The fiber ir x \r{x) of Y over x S X is denoted Y x 
and the tangent space to X at x is denoted T X X, etc. The physical fields will 
be sections of this bundle, which is the covariant analogue of the configuration 
space in classical mechanics. 

We shall develop classical field theory in parallel to classical mechanics. First 
we discuss the field-theoretic analogues of the tangent and cotangent bundles. 
The role of the tangent bundle is played by J X Y, the first jet bundle of Y. 
We recall its definition. We say that two local section <fii, <f>2 of Y agree to 
first order at x G X if each is defined in some neighborhood of x and if their 
first order Taylor expansions are equal at x (in any coordinate system). In 
other words, they agree to first order if and only they have the same target 
y = <f>i(x) = 4>2{x) and if their linearizations T x (f>i and T x <f>2 at x coincide 
as linear maps T X X — ► T y Y . Agreement to first order defines an equivalence 
relation on the set of all local sections; the resulting set of equivalence classes 
is the first jet bundle J Y Y . Saunders [1989] contains much information on jet 
bundles. 

It is convenient to use the natural identification of J l Y with the affine 
bundle over Y whose fiber above y € Y x consists of those linear mappings 
7 : T X X — > T y Y satisfying 

Titxy ° 7 = Identity on T X X. (2A.1) 

The map 7 corresponds to T x <p for a local section <j>. The vector bundle un- 
derlying this affine bundle is the bundle whose fiber over y G Y x is the space 
L(T X X, V y Y) of linear maps of T X X to V y Y, where 

V y Y = {ve TyY I Tttxy -v = 0} (2A.2) 

is the fiber above y of the vertical subbundle VY C TY. Note that for 
7 € J^y, we have the splitting 

T y Y = im 1 ®V y Y. (2A.3) 



Remark 1. One can view sections of J X Y over Y as Ehresmann connections 
on Y. (See Hermann [1975] or Marsden, Montgomery, and Ratiu [1990] for a 
brief review of Ehresmann connections.) The horizontal- vertical splitting l|2A.3(l 
defines the connection associated to such a section. If Y is a principal G-bundle, 
then J X Y — > Y is the bundle whose equivariant sections are connections in the 
usual sense of principal bundles. 2 ♦ 

2 In this case J X Y, as a principal bundle over the connection bundle CY = J 1 Y/G, itself 
has a canonical connection. For the bundle Q X R — » Q regarded as an R-bundle, J 1 (Q X R) 
is isomorphic to the bundle T*Q X R and the canonical connection is the usual contact one- 
form 9. The curvature of this connection is the canonical symplectic form d£>, up to a sign 
depending on conventions. 
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In Chapter 3 we shall study Lagrangians on J X Y and the attendant Euler- 
Lagrange equations, etc., by a procedure parallel to that used for Lagrangians 
on the tangent bundle of a configuration manifold. Our choice of J X Y as the 
field-theoretic tangent bundle reflects the fact that all of the theories we shall 
consider have Lagrangians which depend at most on the fields and their first 
derivatives. For higher order field theories, the appropriate objects to study are 
the higher order jet bundles; see, for example, Aldaya and Azcarraga [1980b], 
Shadwick [1982], Ferraris and Francaviglia [1983], Krupka [1987], Horak and 
Kolaf [1983], de Leon and Rodrigues [1985], Gotay [1991a, b], and references 
therein. 

We let dim A = n + 1 and the fiber dimension of Y be N. Coordinates on 
A are denoted x^, fj, = 0, 1, 2, . . . , n, and fiber coordinates on Y are denoted 
by y A , A = 1, . . . , N. These induce coordinates v A M on the fibers of J X Y . If 
<j> : X — ► Y is a section of ttxy, its tangent map T x <p at x € X is an element of 
Jj,( x )Y. Thus, the map x T x (f> is a section of J X Y regarded as a bundle over 
X. This section is denoted j x (j) and is called the first jet prolongation of (f>. 
In coordinates, j x <p is given by 

x» ^ (jc", <t> A (x») , d v cf) A (x^)) , (2A.4) 

where <9„ = d/dx v . A section of the bundle J X Y — > X which is the first jet of 
a section of Y — > X is said to be holonomic. In this work we shall use global 
sections, although at certain points in the arguments one should really work 
with local sections. It will be clear in each instance what is meant. 



Examples 

a Particle Mechanics. For non-relativistic classical mechanics with config- 
uration space Q, let X = R (parameter time) and Y = R x Q, with ttxy the 
projection onto the first factor. The first jet bundle J X Y is the bundle whose 
holonomic sections are tangents of sections <j) '■ A — > Y (for example, curves in 
Q), so we can identify J X Y = R x TQ. (Note however that J Y Y is an affine 
bundle, while M x TQ has a vector bundle structure). Using coordinates (t, q A ) 
on R x Q, the induced coordinates on J X Y are the usual tangent coordinates 
(t,q A ,v A ). 

In the case of a relativistic particle we would take Q to be the physical 
spacetime equipped with a Lorentzian metric g. (In this context, A = 0, 1,2,3 
is now a spacetime index.) 

b Electromagnetism. For electromagnetism on a fixed background (so that 
the spacetime metric is "frozen" ) , X is four-dimensional spacetime and Y is the 
bundle A 1 A of one-form potentials A. Coordinates for J X Y are (x^, A u , v vll ), 
where the derivative index \x is placed at the end: 

v u^ ° A = d^A u = A v ^. 
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The one-forms A may be thought of as connections on a principal {/(l)-bundle. 
However, here we treat electromagnetism as an ordinary one-form field theory. 
Similarly, one could treat Yang-Mills fields with all the results of the paper 
holding, by taking Y to be the connection bundle over spacetime (i.e., the 
bundle whose sections are connections of a given principal bundle) . 

We also consider the case when the spacetime metric g is variable (but still 
not variational), so that the theory is parametrized. (See Interlude I. If we 
wanted the spacetime metric to be variational, we could couple electromag- 
netism to gravity.) To accomplish this, we merely append g to the other field 
variables as a parameter, replacing Y and J X Y by Y = Y Xx S^^iX) and 
J X Y x x S 2 ' {X), respectively, where S 2 ' {X) is the bundle of symmetric covari- 
ant symmetric two-tensors of Lorentz signature (—,+,+,+). The coordinates 
on J X Y x x S^^iX) are now (a;'*, A v , v vp/ ; g ap ), where we use a semicolon to 
separate variational fields from parametric ones. Note, in particular, that there 
are no "multivclocities" v apfl corresponding to the g ap . 

c A Topological Field Theory. Examples b and d following are both 
"metric" field theories, that is, theories which carry a metric (of some signature, 
usually Lorentzian) on the parameter space X. Here we give a simple example of 
a non-metric field theory: the (abelian) Chern-Simons theory. Topological 
field theories — of which the Chern-Simons theory is perhaps the best-known 
illustration — behave rather differently than metric theories, and exhibit a variety 
of interesting phenomena. We refer the reader to Horowitz [1989] for a fuller 
discussion of such theories. 

Superficially, the Chern-Simons theory resembles electromagnetism. We 
take X to be a 3-manifold and Y — A X X. Coordinates on J X Y are then 
(x M , A v , v vp ). As with electromagnetism, one could view A as a connection 
on some principal [/(l)-bundle over X. One could also consider non-abelian 
generalizations, where U(l) is replaced by an arbitrary Lie group. But the key 
difference here is that we do not introduce a metric on X. 

d Bosonic Strings. Let X be a 2-dimensional manifold with coordinates 
x M for fi = 1,0. Also, let (M,g) be a (d+ l)-dimensional spacetime manifold M 
with coordinates <fi A for A = 0, 1, . . . , d and a fixed metric g of Lorentz signature 
(— ,+, ...,+). A bosonic string is a map <p ■ X — > M. Equivalcntly, its graph 
is a section of the product bundle Y = X x M over X. To write down the 
Lagrangian, we will also need a metric (or at least a conformal metric) on X. 
Thus, let S^iX) denote the bundle over X of symmetric covariant rank two 
tensors of Lorentz signature (—,+). Then a section of S 2 ' (X) is a metric h on 
X. 

There are (at least) three approaches to classical bosonic string theory which 
differ in the choice of independent variables and the Lagrangian. In the first 
(due to Polyakov [1981]), <p an d h are taken as independent variables and the 
Lagrangian is taken as the harmonic map Lagrangian. In the second (due to 
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Nambu [1970]), (j> is taken as the independent variable, while h is taken to be 
the induced metric 

K p (x) = (0*. 9 ) CTp (x) = 9A B{<i>{x)) d^ A d p <f. (2A.5) 

Plugging these into the harmonic map Lagrangian, it reduces to the minimal 
surface (i.e., area) Lagrangian. Between these two extremes is the third version 
which takes <f> and the conformal part of h as the independent variables. Since 
the harmonic map Lagrangian is conformally invariant, it may be reexpressed 
in terms of these variables. For simplicity, we shall only consider the Polyakov 
model in this paper since it treats the metric h variationally. This model has 
also been treated by Batlle, Gomis and Pons [1986]; however they assume that 
g is flat whereas our g is not necessarily flat. Our analysis of the bosonic string 
should also be compared with that of Beig [1989]. One can analyze the Nambu 
model in a similar way; see, for example, Scherk [1975]. 

With (j) and h chosen as the independent variables, the covariant configura- 
tion bundle is Y = (X x M) x x S^' 1 (X) and 

J Y Y = J 1 (X x M) x x J 1 {S^ 1 {X)). 

The coordinates on J X Y are then (x^,<f) A 7 h ap7 v A fl , w apil ). 



Remark 2. Concerted efforts to understand Einstein's theory of gravity have 
provided much of the motivation and insight for this work. It is therefore per- 
haps surprising that the (vacuum) gravitational field is a somewhat exceptional 
case. Here X is again spacetime and the natural choice for Y is S 2 ' {X). With 
this choice, however, one is forced to work on the second jet bundle J 2 Y since the 
Hilbert Lagrangian depends upon the second derivatives of the metric. To avoid 
this, one may adopt the Palatini approach, using the metric and the torsion-free 
connection as the primary fields. In terms of these variables, the Hilbert La- 
grangian is first order, and so the analysis may be done on the first jet bundle. 
The price we must pay for this simplification is that the Palatini formalism is 
plagued by a plethora of second class constraints. Sniatycki [1970b], Szczyrba 
[1976b], and Binz, Sniatycki, and Fischer [1988] have studied this approach and 
it is the one we shall follow. An alternative approach has been developed by 
Ashtekar [1986, 1987] using another set of canonical variables. It would be of in- 
terest to develop the formalism herein for this formulation of gravity; first steps 
in this direction have been taken by Esposito, Gionti, and Stornaiolo [1995]. 

Because of its length and intricacy, we defer the analysis of Palatini gravity 
until Part V. In the meantime, we observe that bosonic strings mimic gravity 
to a significant extent, while having the virtue of being a first order theory with 
only first class constraints. ♦ 
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2B The Dual Jet Bundle 

Next we introduce the field-theoretic analogue of the cotangent bundle. We 
define the dual jet bundle J X Y* to be the vector bundle over Y whose fiber at 
y G Y x is the set of affine maps from JyY to A" +1 AT, where A n+1 X denotes the 
bundle of (n + l)-forms on X. 3 A smooth section of J Y Y* is therefore an affine 
bundle map of J X Y to A n+1 X covering ttxy- We choose affine maps since J X Y 
is an affine bundle, and we map into A n+1 X since we are ultimately thinking of 
integration as providing the pairing on sections. 

Fiber coordinates on J X Y* arc {p 1 pA tl ) 1 which correspond to the affine map 
given in coordinates by 

A (p + PA^u) d n+1 x (2B.1) 

where 

d n+1 x = dx° A dx 1 A • • • A dx n . 

Analogous to the canonical one- and two-forms on a cotangent bundle, there 
are canonical forms on J X Y* . To define these, another description of J X Y* will 
be convenient. Namely, let A := A n+1 Y denote the bundle of (n + l)-forms on 
Y, with fiber over y £7 denoted by A y and with projection ttya ■ A — > Y. Let 
Zc Abe the subbundle whose fiber is given by 

Z y = {z e A y | i v i w z = for all v, w e V y Y} , (2B.2) 

where i„ denotes left interior multiplication by v. 
Elements of Z can be be written uniquely as 

z = p d n+1 x + pA^dy'^ A d% (2B.3) 

where 

d n x^ = d„J d n+1 x. 

Here J also denotes the interior product. Hence fiber coordinates for Z are also 

Equating the coordinates (x^,y A ,p,pA^) of Z and of J X Y* defines a vector 
bundle isomorphism 

$ : z -» j x y*. 

Intrinsically, <I> is defined by 

^{z), 1 )^ 1 *zeA n + 1 X (2B.4) 

where z € Z y , 7 G ^J^, ^ = fxyft/), and (•, •) denotes the dual pairing. To see 
this, note that if 7 has fiber coordinates v A ^, then 

^dx^^dx 11 and j*dy A =v A ^dx" (2B.5) 

3 The space of affine maps from an affine space into a linear space forms a vector space. 
Hence J 1 Y* is a vector bundle, despite the fact that J 1 Y is only an affine bundle. 
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and so 

7* (p d" +1 a; + p A fl dy A A d%) = (p + Pa' 1 ^) d n+1 x, (2B.6) 
where we have used 

da;" A d% = 81 d n+1 x. 

The inverse of $ can also be defined intrinsically, although it is somewhat 
more complicated. Notice that a connection is not needed to define this isomor- 
phism. 

We summarize: 

Proposition 2.1. The spaces J l Y* and Z are canonically isomorphic as vector 
bundles over Y . 

We shall construct canonical forms on Z and then use the isomorphism 
between J X Y* and Z to transfer these to J 1 Y* . We first define the canonical 
(n + l)-form 9 a on A by 

Q\(z) (ui, . . . ,u n+ i) = z (Tir Y A ,Tir Y h ■ u n+ i) 

= {TTp A z)(u 1 ,...,u n+1 ), (2B.7) 

where z £ A and u%, . . . , it n +i S T Z A. Define the canonical (n + 2)- form tt\ 
on A by 

Q A = -d6 A . (2B.8) 

Note that if n = (i.e., AT is one-dimensional), then A = T*Y and 9a is 
the standard canonical one-form. If i\z ■ Z — » A denotes the inclusion, the 
canonical (n + I)- form 9 on Z is defined by 

9 = i* Az e A (2B.9) 

and the canonical (n + 2)- form il on Z is defined by 

n = -de = i* AZ n A . (2B.10) 

The pair (Z, Q) is called multiphase space or covariant phase space. It is 
an example of a multisymplectic manifold; see Remark 3 below. 

From ||2E| and f2B~7jl - (j2B.10jl . one finds that the coordinate expression 
for 9 is 

e= PA fi dy A Ad n Xfl +pd n+1 x, (2B.11) 

and so 

Q = dy A A A d% — dp A d n+1 x. (2B.12) 

Remarks 1. There are other possible choices of multiphase space (cf. Eche- 
verria-Enriques, Muhoz-Lecanda, and Roman-Roy [2000] and the references 
cited at the beginning of this chapter), but this one is the most relevant and ef- 
ficient for our purposes. There are good reasons for this choice: it is natural, as 
shown in Proposition 2.1, and it carries a canonical multisymplectic structure. 
We refer to Gotay [1991a, b] for more on why one makes this particular choice 
of multiphase space. 
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2. A notable feature of the covariant formalism is the appearance of several 
momenta pa^, spatial in addition to temporal, for each field component y A . The 
Legendre transformation of Chapter 3 will provide a physical interpretation of 
these "multimomenta" as well as of the "covariant Hamiltonian" p. As we shall 
see, the afhne nature of our constructions provides a unification of the two parts 
of the Legendre transformation, namely the definitions of the momenta and the 
Hamiltonian, into a single "covariant" entity. 

3. Whether or not there is a proper abstract definition of "multisymplectic 
manifold" is open to question. As a first try, we could say that a multisym- 
plectic manifold is a manifold Z endowed with a closed fc-form f2 which is 
nondegenerate in the sense that iyfi ^ whenever V is a nonzero tangent vec- 
tor. This would include our case, symplectic manifolds (k = 2), and manifolds 
with a volume form (fc = dimZ). But for all cases intermediate between sym- 
plectic and volume structures there is no Darboux theorem. Indeed, the set of 
germs of such fc-multisymplectic forms for 2 < k < dim Z has functional mod- 
uli: there are local differential invariants of such a form, distinguishing it from 
arbitrarily nearby multisymplectic forms. Such a theory would thus require ex- 
tra structure, perhaps mimicking the fibration information contained in our Z, 
or else would proceed in a manner having very little in common with modern 
symplectic geometry where the Darboux theorem is such a central organizing 
fact. For attempts at such a "multisymplectic geometry" see Martin [1988] or 
Cantrijn, Ibort, and de Leon [1999]. ♦ 

We next recall that there is a characterization of the canonical one-form 9 
on T*Q by (3*9 = (3 for any one-form (3 on Q (Abraham and Marsden [1978], 
Proposition 3.2.11). Here is an analogue of that fact for the canonical (n + 1)- 
form. 

Proposition 2.2. If a is a section of nxz and (f> = nyz &, then 

cr*e = (f>*a, (2B.13) 

where <f>*a means the pull-back by 4> to X of a regarded as an (n + l)-form on 
Y along 4>. 

Proof. Evaluate the left-hand side of equation i|2B.13(l on tangent vectors 
Vx, ■ ■ ■ , i>n+i to X at x: 

(a*Q)(x)(vx, ■ ■ • , v n+ x) = (<r(x)) {T<r -vi,...,Ta- V n+ x) 

= a(x) (Tttyz - Ta-vx,..., Ttt Y z • Ta ■ v n+1 ) 

= a(x) (T<fi -vi,...,T<p - v n+ x) 

= (0*cr) (x) Oi, . . . ,v n+ x) . ■ 
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Examples 

a Particle Mechanics. Let X = K and Y = K x Q. Then Z = T*Y = 
T*R x T*Q has coordinates (t,p, q 1 , . . . , q N , p\, . . . ,pn), 

e= PA dq A +pdt (2B.14) 

and 

SI = dq A A dp A + dtA dp. (2B.15) 

In this case the multisymplectic approach reduces to the extended state space 
formulation of classical mechanics. 

b Electromagnetism. For the Maxwell theory, coordinates on Z are 

Later we shall see that can be identified, via the Legendre transform, with 
the electromagnetic field density, so we use this notation in anticipation. From 
(|2B.11|I and (|2B.12jl . we get 

6 =$ v > 1 dA v Ad 3 Xtl +pd 4 x (2B.16) 

and 

ft = dA v A riff"" A d 3 Xfl -dp A d 4 x. (2B.17) 



When the spacetime metric g is treated parametrically, Z is replaced by Z = 
Z x x S^iX) with coordinates A v ,p, ffcrp). There are no multimomenta 
conjugate to the g ap . The expressions for the canonical forms remain the same. 

c A Topological Field Theory. In the case of the Chern-Simons theory, 
coordinates on Z are A v ,p,p vp -). The canonical forms are 

9 =p vtl dA v Ad 2 Xfl +pd 3 x (2B.18) 

and 

fl = dA v A dp"' 1 A d 2 x p - dp A d 3 x. (2B.19) 
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d Bosonic Strings. Corresponding to Y = (X x M) Xx S 2 ' (X), the co- 
ordinates on Z are (x^, <j) A , h ap ,p,pA p , q app ), and the canonical forms are given 

by 

9 = (p A M d<t> A + q app dh ap ) A d 1 ^ +pd 2 x (2B.20) 

and 

CI = {d(j) A A dp A p + dh ap A dq app ) A d x x p - dp A d 2 x. (2B.21) 



We conclude this section with a few remarks on Poisson brackets. Poisson 
brackets are not essential for the main points in this paper, but we believe that 
they are important for linking up the results of this paper with those of Marsden, 
Montgomery, Morrison, and Thompson [1986] and for the future development 
of covariant reduction (see Marsden [1988]). For more thorough discussions of 
Poisson brackets in field theory, we refer the reader to the recent papers by 
Kanatchikov [1997], Lawson [2000], Castrillon and Marsden [2002], and Forger, 
Paufler, and Romer [2002a, b]. 

Recall that on a symplectic manifold (P, CI), the Hamiltonian vector field Xf 
of a function / e S'(P) is defined by 

ci(x f ,v) = vj df 

for any V £ X(P) , and the Poisson bracket is defined by 

{/, h} = ci(x f ,x h ). 

The multisymplectic analogue of a function is an n-form. In Goldschmidt and 
Sternberg [1973] and Kijowski and Szczyrba [1975] a Poisson bracket is defined 
on certain types of n-forms. A special case relevant for later considerations is 
as follows. Let V be a vector field on Y which is 7rxy-projectable; that is, there 
is a vector field U on X such that 

Tttxy oV = Uo ttxy ■ 

Consider an n-form on Z of the type 

f(z)=w YZ (VJ z) (2B.22) 

for some Tr^y-projectable vector field V; we call such an / a momentum 
observable. (As we shall see in §4C, special covariant momentum maps can 
be regarded as momentum observables.) The Hamiltonian vector field Xf 
of such a momentum observable / is defined by 

df = Xf J CI 

where fl is the canonical (n + 2)-form on Z. Since CI is nondegenerate, this 
uniquely defines Xf. The Poisson bracket of two such n-forms is the n-form 
defined by 

{f,h}=X h j(X f jCl). (2B.23) 
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For momentum observables / and h, {/, h} is, up to the addition of exact 
terms, another momentum observable; these constitute an important class since, 
as we shall see, it includes superhamiltonian and supermomentum functions, 
for instance. Also, the development in Chapter 5 of Part II shows that when 
(|2B.23j) is integrated over a hypersurface, it produces a canonical cotangent 
bundle bracket. 

3 Lagrangian Dynamics 

This chapter summarizes some of the basic facts concerning the Euler-Lagrange 
equations, the Legendre transformation and the Cartan form that will be needed 
later. There are many formulations available in the literature, including those 
of Caratheodory [1935], Weyl [1935], Lepage [1936,1941,1942], Cartan [1922], 
Dedecker [1957,1977], Trautman [1967], Hermann [1968], Sniatycki [1970a, 
1984], Krupka [1971], Ouzilou [1972], Goldschmidt and Sternberg [1973], Garcia 
[1974], Szczyrba [1976a], Sternberg [1977], Kupershmidt [1980], Aldaya and de 
Azcarraga [1980a], Griffiths [1983], and Anderson [1992]. For second order the- 
ories, see Krupka and Stepankova [1983] and, for the n th order case, Horak 
and Kolaf [1983], Munoz [1985], and Gotay [1991a, c]. Exactly which route one 
takes to end up with the basic results is, to some extent, a matter of personal 
preference. For example, one may introduce the Cartan form axiomatically 
and then define the Legendre transformation in terms of it; see for instance 
Sniatycki [1970a, 1984] for this point of view. We have chosen to introduce the 
Legendre transformation directly and then use it to define the Cartan form to 
most closely parallel the standard treatment in classical mechanics (Abraham 
and Marsden [1978], Section 3.6). We shall continue to confine ourselves to first 
order theories. 

As before, let ttxy '■ Y — > X be a bundle over X whose sections will be the 
fields of the theory. In Chapter 2 we studied the spaces J X Y and Z = J 1 !"* 
which are the field-theoretic analogues of the tangent and cotangent bundles in 
particle mechanics. Here we are concerned with the relationship between the 
two and with Lagrangian dynamics on the former. 

3A The Covariant Legendre Transformation 



the Lagrangian density, be a given smooth bundle map over X. In coordi- 
nates, we write 



We remark that since X is assumed oriented, we may take £(7) to be an (n+1)- 
form on X. If X were not orientable, then we should more properly take £(7) 
to be a density of weight one on X. 



Let 



£ : J X Y -» A n+1 A, 




(3A.1) 
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Our first goal is to construct the covariant Legendre transformation for L. 
This is a fiber-preserving map 

EC : J l Y -> ^Y* = Z 



over Y which has the coordinate expressions 



Pa" 



dL 

dv A 



-, p = L- 



^ V A 
dv\ V " 



(3A.2) 



for the multimomenta pa^ and the covariant Hamiltonian p. An intrinsic defi- 
nition follows. 

If 7 G JyY then F£(7) is the affine approximation to L | J^F at 7. Thus 
FCf/y) is the affine map from J^Y to A" +1 A (where y € Y^) given by 



<F£( 7 ), 7') = £-(7) + ^ (7 + e(Y ~ 7)) 



(3A.3) 



e=0 



where 7' S ^J^- If one wishes, the second term in (|3 A. 3|> can be interpreted 
in terms of a standard fiber derivative. To derive the coordinate expressions 
(|3A.2p . suppose 7 = v A ^ and 7' = 
reads 

dL 



Then the right hand side of l|3A.3|) 



8v A , 



-ry 



which is an affine function of w ^ with linear and constant pieces given by the 
first and second equations of l|3A.2|) respectively. Hence l|3A.2|) is indeed the 
coordinate description of WL. (The coordinatization of J X Y* was defined by 
j2BjJ.) 



3B The Cartan Form 

The Cartan form is the (n + l)-form G>£ on J X Y defined by 

On = ¥L*Q, 



(3B.I) 



where G is the canonical (n + l)-form on Z. We also define the (n + 2)-form 
^£ by 



where fl = — dO is the canonical (n + 2)-form on Z. 
In coordinates, H2B.II|I . (|2B.12fl and (|5X2)l yield 



9i 



dL 
dv A , 



■dy A A d n Xa + L 



dv A , V " 



and 



d n+1 a 



(3B.2) 



(3B.3) 



ti L = dy A A d 



dL 
dv A , 



AtT 



-d L- 



8L A 
dv A , V " 



A d 



(3B.4) 
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One application of the Cartan form is to reexpress the Lagrangian for holo- 
nomic fields as 

= (3B.5) 
To see this, we use lj3B.3|) to compute 

BT f BT \ 

(iV)*^ = ^CjV) d<p A a d% + V) - ^-O'V) <^J 

The first and last terms cancel and so we obtain (|3B . 5|) . 

It is possible to obtain ©£ directly via the action functional; it is constructed 
from the boundary terms that appear when the action integral is varied. The de- 
tails can be found in, e.g., Paquet [1941], Krupka [1987], and Marsden, Patrick, 
and Shkoller [1998]. 



3C The Euler— Lagrange Equations 

We now use the formalism developed so far to reexpress, in an intrinsic way, the 
Euler Lagrange equations, which in coordinates take the standard form 

w^-M-^)- (3ai) 

for a (local) section </> of Y. The left hand side of l|3C.l(l is often denoted 8L/8<f) A 
and is called the Eulei — Lagrange derivative of L. 

Theorem 3.1. The following assertions regarding a section (f> of the bundle 
ttxy '■ Y — > X are equivalent: 

(i) (f> is a stationary point of / ^(j 1 ^); 

Jx 

(ii) the Euler-Lagrange equations i|3C.l(l hold in coordinates; 

(iii) for any vector field W on J l Y , 

{j 1 (j>)*{Wjn c ) = 0. (3C.2) 



Remarks 1. The theorem still holds if the section <fi is only a local section: 
(f> : U — > Yjj where U is an open subset of X. The only change is that the 
integral in (i) is now over U. The case that will be of importance to us is where 
U is a neighborhood of a Cauchy surface. 



2. The meaning of "stationary point" is in the formal sense of the calculus of 
variations. Namely, a variation of cj) is provided by a curve <f>\ = n\o <f>, where 
r/\ is the flow of a vertical vector field V on Y which is compactly supported in 
X. One says that is a stationary point if 

d 



dX 



X 







A=0 



for all variations (f>\ of <j>. 
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3. This theorem appears more or less as we present it in, for example, Dedecker 
[1953], Trautman [1967], Sniatycki [1970a, 1984], Ouzilou [1972], Goldschmidt 
and Sternberg [1973], and Guillemin and Sternberg [1977]. 

4. If in (|3C.2(I one requires that s*(i\ySln) = for any section s of J X Y — > X, 
not necessarily holonomic — that is, not necessarily of the form j 1 ^ — then con- 
dition (iii) will be equivalent to the Euler-Lagrange equations and the assertion 
that s is holonomic under the assumption that the Lagrangian is "regular." A 
Lagrangian is called regular if the Legendre transformation is of maximal rank 
or, equivalently, if (J 1 Y, Sl&) is a multisymplectic manifold. In the formulation 
as we have given it, no assumption of regularity is needed; in fact such an as- 
sumption would be very much inappropriate, since it is not valid in any of our 
examples. See, however, Garcia and Munoz [1985,1991] and Saunders [1992]. 

5. Because the Euler-Lagrange equations are equivalent to the intrinsic con- 
ditions (i) and (iii), they too must be intrinsic. One can in fact write the 
Euler-Lagrange derivative intrinsically in a direct way. See Trautman [1967] 
and Marsden, Patrick, and Shkollcr [1998]. 

6. In the above we have restricted attention to vertical variations. But it is 
occasionally necessary to take "horizontal variations," as in continuum mechan- 
ics where they produce configurational forces which are important in dislocation 
theory. See Marsden, Patrick, and Shkoller [1998] for details, and Lew, Marsden, 
Ortiz, and West [2002] for further applications of the split between horizontal 
and vertical variations. ♦ 

In the course of the proof, we will examine three cases for the vector field 
W in (|3G.2|I . The following lemma shows that two of the cases are trivial. 

Lemma 3.2. If <p is a section ofirxY and if either W is tangent to the image 
°f j <fi 0T W is Ky ^y -vertical, then 

(j^nwj si c ) = o. 

Proof. First assume that W is tangent to the image of j 1 ^ in J X Y\ that is, 
W = Tfj 1 ^) • w for some vector field w on X. Then 

CjV)*(WJ Sl c ) = -w) J n c ) 

= w j (j^ysic, 

which vanishes since (j 1 ^)*!^ is an (n + 2)-form on the (n + l)-manifold X. 
Secondly, if W is 7Ty jiy-vertical, then W has the coordinate form 

W = (0,0,W\). 
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A calculation using l|3B.4|) shows that 

d 2 L 



WJ Ox 



-W B V 



dv A „dv B v 



(dy A A d% - v A ^d n+1 x) 



which vanishes when pulled back by the jet prolongation of a section of Y. ■ 

Proof of Theorem 13.11 The direct proof of equivalence of (i) and (ii) is a 
standard argument in the calculus of variations that we shall omit. We shall 
work towards a proof of the equivalence of (i) and (iii) and then of (iii) and (ii) . 

To show that (i) is equivalent to (iii), consider a variation <p\ — r]\ o (f> of 
4> (as in Remark 2) corresponding to a 7rxy-vertical vector field V on Y with 
compact support in X. Using l|3B.5|) . we compute 

£(jVa 



d 

dX 



A=0 



d 

dX 



x 



A=0 



d_ 

dX 



(?V)*(?'V)*e 



A" 



A=0 



(j^riwec, 



(3C.3) 



X 



where 



3 X V 



dX 



A=0 



is the jet prolongation of V to J l Y , given in coordinates by 



j 1 V= [o,v 



dV A dV A t 
1 v 

dx^ dy B 

and where £jiy is the Lie derivative along j V. (See §4A for an intrinsic defini- 
tion of jet prolongations and for more information; in particular, see Remark 1 
of that section). Thus, from (|3C.3I) and 



£ j i v Q L = -jty j J] £ +d(jVj Qc), 



we get 



d 

dX 



x 



A=0 



(iV)*(iV j n z ) 



X 



+ / d(jV)*(jV J Qc) 
Jx 

(iVrO'VjOr.) (3C.4) 
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by Stokes' theorem and the fact that V, and hence j x V is compactly supported 
in X. From (|3C.4(1 it follows that (iii) implies (i). 

To show the converse, we first observe that any tangent vector field W on 
J l Y can be decomposed into a component tangent to the image of j l 4> and a 
7Tj^jiy-vertical vector field. Similarly, any 7rx,jiy-vertical vector field can be 
decomposed into a jet extension of some Tr^y-vertical vector field V on Y and 
a 7Ty jiy -vertical vector field. 

Thus, if (i) holds, (|3C.4|) and Lemma E21 show that 



for all vector fields W on J l Y with compact support in X. Since W can be 
multiplied by an arbitrary scalar function on X, an argument like that in the 
Fundamental Lemma of the Calculus of Variations shows that the integrand 
must vanish for all vector fields W with compact support in X. A partition of 
unity argument then shows that the integrand vanishes for any vector field W 
whatsoever, which proves (iii). 

To see directly that (iii) is equivalent to the Euler-Lagrange equations, one 
readily computes in coordinates that along j (f>, 




(iV)*(jVj Sic) =v 




d_ ( dL 



) 



dL 



(J'V) d 



•n+l, 



X 



(3C.5) 



dx 11 \dv A ^ 



dy A 



for all 7Txy-vertical vector fields V on Y. Thus (iii) implies (ii). On the other 
hand, (|3C.5|) combined with the above remarks on decompositions of vector 
fields and Lemma f3 . 21 shows that (ii) implies (iii). ■ 
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Examples 

a Particle Mechanics. With the Lagrangian density & = L(t,q,v)dt, ex- 
pressions <|3A.2|) become pa = dL/dv A and p — L — v A dL/dv A — —E, the Car- 
tan form becomes the usual one-form for describing time-dependent mechanics 

BT 

e L = —rdq A -Edt, (3C.6) 
OV A 

and the Euler-Lagrange equations take the standard form 



d dL 
dt dv A 




(3C.7) 



Let us specifically work out the details for a relativistic free particle of mass 
m 7^ 0. The Lagrangian density is 

L = -m||v||eJf, (3C.8) 

where v = (v , v , v ,v ) is the velocity of the particle (with respect to param- 
eter time t) and ||v|| = (~gABV A v B ) 1 ^ 2 , g being the spacetime metric. 4 The 
Cartan form (|3C.6(I is thus 

9, = ^^dq A , (3C.9) 



whence 

m 9ABV B rimr>\ 
Pa = — n — n — an( l p = 0. (3C.10) 

ll v ll 

The equations of motion l|3C.7|l are just the geodesic equations for the spacetime 
metric g: 

d ( gABV B \ 1 B c fontf^ 

where t is used as the curve parameter. 

4 Sometimes in this context one encounters the Lagrangian density 

/ 1 M ,,2 

= — m v dt. 
2 

However, unlike XL given by I3C.8I XL' is not time reparametrization-invariant. In §11C we shall 
see how to 'repair' this defect in XL'. Of course, if one does not require time reparametrization- 
invariance, then the choice XL' of Lagrangian density is certainly permissible. Misner, Thorne, 
and Wheeler [1973] give a detailed discussion of these two Lagrangian densities in §13.4. 
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b Electromagnetism. For electromagnetism on a fixed background space- 
time X with metric g, the Lagrangian density is 5 

L = -^F^F^^gd 4 x, (3C.12) 

where \J—g := yf— dctTg^ and, viewed as a function on J 1 (A 1 X), the field 
tensor is defined by 

F^v = v Ufl - v^ v . (3C.13) 
The Legendre transformation gives the relations 

r v =F^yf—g and p = ±F^F»» V=9- (3C.14) 
The Cartan form 0£ is 

& c = yf—gF^dA v A d 3 x^ + ^F^F^y/=g~d 4 x (3C.15) 
and the 5-form il^ = — dO^ is 

Oc = ($"V P - <W) A dv A d 3 x„ 

- y/^gF^dDvp A d 4 a;. (3C.16) 

For a section i of 7 = A X A, H3C.13jl gives (j 1 ^)*^ = dA and the Euler- 
Lagrange equations reduce to 

(A v >» - A* v ). v = 0. (3C.17) 

Here the semicolon denotes a covariant derivative with respect to the background 
metric, so (|3C.17|I is a covariant divergence. 

When g is treated parametrically, none of the formulas above change. In 
particular, since g is not variational, it has no Euler-Lagrange equations. Even 
so, varying the Lagrangian density with respect to g produces the stress-energy- 
momentum tensor of the electromagnetic field, as shown in Example b of §4D 
and Interlude II. 



c A Topological Field Theory. The Chern-Simons Lagrangian density is 

£ = ^"F^K d 3 x, (3C.18) 

where the curvature F of A is defined by l|3C.13|) . Intrinsically, £ = FA A 
Applying the Legendre transformation, we get 

p u» = e »»a Au and p = o (3C.19) 

5 For convenience, in this example we use I3C.12I . with the coefficient —1/4, instead of 
the Misner, Thorne, and Wheeler [1973] Lagrangian, with coefficient — 1/167T. One may pass 
from one convention to the other by scaling the electromagnetic fields. 
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The Cartan form is thus 

Be = e» va A a dA v A d 2 Xfl , (3C.20) 

whence 

Ml = -e^ a dA a A dA v A d 2 x p . (3C.21) 

The Euler-Lagrange equations are e^^F^ = 0, which are equivalent to the 
connection A being fiat: 

{.fAYF = dA = 0. (3C.22) 

That these equations are only of the first order indicates that the Chern-Simons 
theory is highly degenerate. 

d Bosonic Strings. In the Polyakov model, with <f> and h as independent 

variables, the action is taken as the negative of the "energy" functional. Thus 
as a function on J l Y , the Lagrangian density is 

^ = -\VW\h ap 9A B v A ry v B p d 2 x. (3C.23) 
The Legendre transformation is 

VA tl = ~V\h\h ,lu 9ABV B u (3C.24) 

q apt " = (3C.25) 

P = \VW\h^9ABV A ^v B „ (3C.26) 
so that the Cartan form is 

e^ = VW\ [~h^9ABV B v dcjy A A d x x^ + h^g AB v A ll v B , d 2 x^j (3C.27) 

and 



- d i^VW\h^9ABV A p v B u J A d 2 x. (3C.28) 

The Euler-Lagrange equations reduce to 

{h^gAB(^)<p B m)^ = (3C.29) 

(hi^gAB^)^,,,^,^} h *0 = 9CD{m C A D ,P- (3C30) 

Equatio n H3C.29|> says that <fi satisfies the harmonic map equation. Equation 
(|3C.30(I does two things: 
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(i) it says h is conformally related to the induced metric cf>*g on X: h 2 h a p — 
(4>*g) a /3, and 

(ii) it determines the conformal factor: A 2 = g ab^)^ .p.§ B ' ,v ■ 

At first glance, it might appear that (ii) is significant while (i) is insignificant, 
since in two dimensions "any two metrics are conformally related." However, 
the content of i|3C.30() is, in fact, just the reverse. The actual theorem states 
that in two dimensions, given two metrics, there is a diffcomorphism which 
transforms one metric into a multiple of the other. Equation (|3C.30() says that 
in any coordinate system h is proportional to <j)*g. There is no need for a 
diffcomorphism. 

We summarize the content of <|3C.30() : There are three equations. Two say 
that the components of h and <p*g are proportional while the third determines 
the proportionality constant. The first two have content while the third is an 
identity. 



4 Covariant Momentum Maps and Noether's 
Theorem 

A key feature of relativistic field theories is that not all of the Euler-Lagrange 
equations necessarily describe the temporal evolution of fields; some of the equa- 
tions may impose constraints on the choice of initial data. Those constraints 
which are first class in the sense of Dirac [1950, 1964] reflect the gauge symme- 
try of the theory and play a central role in our development. In Part III we 
will see that these first class constraints are related to the vanishing of various 
momentum maps. 

To lay the groundwork for this, we now study symmetries in the Lagrangian 
and multisymplectic formalisms. First, we show how automorphisms of Y lift 
to J Y in a way compatible, via the Legendre transform, with their canonical 
lifts to Z . Second, we introduce the notion of a covariant momentum map for 
a group S acting on Z by covariant canonical transformations. Third, we prove 
that the lifted action of a group S on J Y has a covariant momentum map 
(with respect to the Cartan form) which is given by the pull-back to J Y Y of the 
covariant momentum map on Z by F£. Then we prove Noether's theorem, here 
formulated in terms of covariant momentum maps. 

4A Jet Prolongations 

Here we show how to naturally lift automorphisms of Y to automorphisms of 
J l Y . In effect, we shall construct the covariant analogue of the tangent map. 

Let r\y : Y — > Y be a 7rxy-bundle automorphism covering a diffcomorphism 
rj X ■ X -> X. If 7 : T X X -> T y Y is an element of J X Y ', let r}jx Y {i) ■ T nx[x) X -> 



36 



§4 Covariant Momentum Maps and Noether's Theorem 



T^iy-sY be denned by 

Vj 1 y{i) =Tt) Y ojoTrix 1 . (4A.1) 

The 7ivjiy-bundle automorphism j rjy ■= , t]j 1 y so constructed is called the 
first jet extension or prolongation of rjy to J X Y. In coordinates, if 7 = 
(x t *,y A ,v A tJ ,), then 

ruMl) = (r&W,r£(x,y), [d vV A + (d B r, A ) v B „} d^x'Y) > (4A.2) 

where 9b = d/dy B . Occasionally we are interested in the case when rjy is verti- 
cal; that is, r\x is the identity. Equations i|4A.l|) and l|4A.2(l simplify accordingly 
in this case. 

If V is a vector field on Y whose flow is r)\, so that 



V o rjx = 




then its first jet extension or prolongation j V := Vjiy is the vector field 
on J 1 !" whose flow is j ny 

j'VojV = ^iV- (4A.3) 
In coordinates, by differentiating (|4A.2|) . we obtain 

? V - fV V A ^ + ^v B - v A ^] UA 4) 

It is important to note that if <f> is a section of 7Txy and r\y is a 7rxy-bundle 
automorphism covering r]x, then ify ■ <f> '■= T]y <f> ifx 1S another section and 

j 1 (VY ■ <f>) = j^Y ° j 1 ^ ° Vx 1 ■ (4A.5) 
See Garcia [1974] and Aldaya and de Azcarraga [1980a] for more information. 

Remarks 1. If <fi\ = r)\ o <fi is a variation of (as in Remark 2 of §3C), then 
rix is the identity and 14A.5(1 becomes 

In this case, the generating vector field of the flow rj\ is vertical and l|4A.4|) 
reduces to 

, / 4 dV A dV A p> \ 

j V= [°> v ^ + w v & 

2. In the above mentioned references, an additional term v B \ l v A \dV x / dy B is 
present in the third component of i|4A.4l) ; this is absent here since we consider 
only transformations of Y which are bundle automorphisms so V x is indepen- 
dent of y B . 
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3. A word of caution: One can also view V as a section of the bundle TY — > Y 
and take its first jet in the sense of equation (|2A.4() . Then one gets the section 
of J 1 (TY) — > Y given in coordinates by 

/ u a, ( u a r, u jrA dv» aw ov A dv A \ 

This section is not to be confused with j x V as defined by l|4A.3|) and (|4A.4(1 : 
they are two different objects. 

4. From the definitions one finds that j 1 ([F, W}) = [fV, j x W] . ♦ 



4B Covariant Canonical Transformations 

A covariant canonical transformation is a 7rxz-bundle map r\z ■ Z — > Z 
covering a diffeomorphism r\x '■ X — > X such that 

?f z Q = Q. (4B.1) 

We say r\z is a special covariant canonical transformation if 

r)* z e = 6. (4B.2) 

If t]y '■ Y — > Y is a 7r^y-bundle automorphism (also covering a diffeomor- 
phism r/x X X), its canonical lift rjz '■ Z — ► Z is defined by 

»7z(*) = (^y 1 )^; (4B.3) 

that is, 

?7^(z) . . .,v n +i) = z (Trjy 1 -Vi,.. ■ ,Tr)y X ■ v n +i) , 

where z £ Z y and Vi, . . . , € T VY ( y -)Y. Since is a 7rxy-bundlc map, T?7y 
maps vertical vectors to vertical vectors, so r\z does indeed map Z to Z. In 
coordinates, if we write z = pd n+1 x + pA ti dy A A d n x t _ l , and notice that 

and 

(^ 1 )*dj / A = ^fe 1 ) A ^ + a iJ (7 ?i ; 1 ) A dt/ s ! 

we get 

»7z(z) = (VyT 2 = (p + dviVvYp^d^Vx) J-'d n+1 x 

+ (dAiVy 1 ) 3 PB V d u r]^j J- 1 dy A A d%, (4B.4) 

where J is the Jacobian determinant of r\x and where the quantities in 1|4B.4|) 
are evaluated at the appropriate points. 
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Proposition 4.1. Lifts are special covariant canonical transformations. 



This is readily proved using a method similar to the cotangent bundle case 
(Abraham and Marsden [1978], Theorem 3.2.12). 

Here is a second way to understand the lift of a 7rxy-bundle automorphism 
j-jY ■ Y — ► Y covering a diffcomorphism i] X : X — > X, which proceeds via ^Y* 
rather than directly to Z. We define the lift of r\y to J 1 !"* to be the affine dual 
of the prolongation rjjiy; that is, if z : JyY — » A" +1 A is an element of JyY*, 
let 



n+l 



be the affine map defined by 



(4B.5) 



In coordinates, if z = (p,p^) acts on 7 = y A , v A ^) as in (|2B.1|I . then from 



= {n x 1 )* [(p+PA^x [dv(vY 1 ) A + v B »d B (r,y 1 ) 

(p + OuiVvYPA^Vx) + (d A (Tfy 1 ) B PB U duT&)v 



(4B.6) 



J~ L d 



so that the terms in (|4B.6|I correspond to the components in 1|4B.4() via (|2B.6(1 . 

Consequently, we have given a coordinate proof of 

Proposition 4.2. The canonical isomorphism $ : Z — > J l Y* is equivariant 
with respect to the lifts r\z and r\j\ Y *; that is, 



<£> o r]z = TJjiy* & 



(4B.7) 



Here is an invariant proof of Proposition 4.2 using (|2B.4|) . dj4A.ll> . i|4B.3|) 
and (|4B . 5|> above. For z £ Z and 7 G J l Y, we have 

(t, Jly *($(z)),7) - (*7i 1 )*<*(*)»vV(7)) 

= (r 7x 1 )*($(z),r7 ?y 1 o 7 oTr y x) 



and so (|4B . 7fl holds. Thus, the identification of Z and J 1 !^* respects the action 
of a lifted covariant canonical transformation. 
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Let V be a vector field on Y whose flow rjx maps fibers of ttxy to fibers. The 
canonical lift of V to Z is the vector field Vz that generates the canonical lift 
of this flow to Z. By differentiating i|4B.4[) we obtain 

V z = (V, V A , -pV, v - p B "V E \u,PA v V > - pb"V b . a - pa»V\ v ) (4B.8) 

in coordinates, where V = (V^, V A ). By either direct calculation or by Propo- 
sition ^3 we get £vz® = 0- 

4C Covariant Momentum Maps 

We now introduce the covariant analogue of momentum maps in symplectic and 
Poisson geometry (sec for example, Abraham and Marsden [4978], Marsden, 
Weinstein, Ratiu, Schmid, and Spencer [4983], Guillemin and Sternberg [1984], 
Marsden [4992], and Marsden and Ratiu [1999]. 

Let S denote a Lie group (perhaps infinite-dimensional) with Lie algebra q 
that acts on X by diffeomorphisms and acts on Z (or Y) as ttxz- (or ttxy~) 
bundle automorphisms. For r\ € S, let r)XiWY, and rjz denote the corresponding 
transformations of X,Y, and Z, and for ( £ 5, let £,x,£,y, and £z denote the 
corresponding infinitesimal generators. If S acts on Z by covariant canonical 
transformations, then the Lie derivative of Q along £z is zero: 

£ (z n = (4C.4) 

and if 9 acts by special covariant canonical transformations, then 

£ (z Q = 0. (4C.2) 

If 9 acts on Z by covariant canonical transformations, then a covariant 
momentum map (or a multimomentum map) for this action is a map 

J : Z g* ®k n Z = L( Q ,A n Z) 

covering the identity on Z such that 

dJ(0=ifcJl, (4C.3) 

where J(£) is the n-form on Z whose value at z € Z is (J(z),£). A covariant 
momentum map is said to be Ad* -equivariant if 

j(Ad-^) =V*z[m]- (4C4) 

If 9 acts by special covariant canonical transformations, there is an explicit 
formula for a special covariant momentum map. We claim that 

j(0 = i Cz e (4C.5) 

is a covariant momentum map. Indeed, 



dj(o - di Cz e = (£ 5z - i Cz d)e = i iz n. 
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Proposition 4.3. Special covariant momentum maps are Ad* -equivariant. 

The proof is analogous to that for the cotangent bundle case (Abraham and 
Marsden [1978], Theorem 4.2.10). For the particular case of lifted actions, we 
have: 

Proposition 4.4. If the action of S on Z is the lift of an action of S on Y , 
then the special covariant momentum map is given by 

A0(z) =*Yzkr*- ( 4C - 6 ) 
This momentum map is Ad* -equivariant. 

This follows from the recognition that £y = Tiryz ■ £,z, the definitions of 
J(£) and 6, and Proposition 14.31 

In coordinates, if £y has components {^,£, A ), then equation (|4C.6|I reads 

J(0(z) = ( PA ^ A + p£")d% ~ PA ^»dy A A d n -\ v , (4C.7) 

where 

d^x^ = i d J a J n+1 x. 



In passing, we reformulate equivariance for lifted actions in terms of Poisson 
brackets, just as one does in the usual context of symplectic and Poisson mani- 
folds. Observe that according to l)4C.6|l . special covariant momentum maps are 
"momentum observables" in the sense of l|2B.22j) . 

Proposition 4.5. For lifted actions, 

{ .7(0, J(C)} - d(i ez i c ,6) + J([t C]). (4C.8) 

Proof. Set rj\ = exp(AC). Differentiating the equivariance relation (|4C.4|) 
with respect to A at A = gives 

■/([£, CD = £ Cz J(0 = di c ,j(0 + i c ,dj(0 = di Cz j(0 + i Cz k z n 

by PTBll . This, JEHU, and I2R.23H give PTTsll . ■ 



Examples 



Since X is assumed oriented, Diff(A) will henceforth denote the group of 
orientation-preserving diffeomorphisms of X. 
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a Particle Mechanics. For classical mechanics with time reparametrization 
freedom (see, for example, Kuchaf [1973]), we let 9 = Diff(R), the diffeomor- 
phism group of X = R. Then 9 acts on Y = R x Q by time reparametrizations 
and this action may be lifted to Z = T*R x T*Q. We identify q with the space 
y(R) of smooth functions on R and identify 5F(R)*, the space of densities on R, 
with 5F(R). In terms of the coordinates of Example a of §2B, we have 

for x G ?(R). The momentum map 

J : Z = T*R x T*Q -> ?(R)* w J(R) 

is defined by 

(J(t,p,«\... =PX(t). (4C.9) 

If we consider a classical mechanical system with a fixed time parametriza- 
tion and with a group G acting on Q, then the covariant momentum map induced 
on Z is the usual momentum map on T*Q regarded as a map on Z . 

In this example, note that covariant canonical maps include time- dependent 
canonical transformations. 



b Electro-magnetism. The appropriate invariance group for electromag- 
netism on a fixed spacetime background X is 9 = 3{X), the additive group of 
smooth functions on spacetime. Let 9 act on Y = A X X as follows: For / £ 3^(X) 
and A £ A*X, 

/ • A = A + df(x) e K\X. (4C.10) 
For x G g = 3 r {X) 1 the infinitesimal generator is given by 

Xy{a) = x - v Ja v - (4C ' n) 

From (I4C.7I) . the corresponding covariant momentum map is 

(J(x,A,p,$), X ) = 3""*X,*dV (4C.12) 



If we treat the metric g on X parametrically, we may enlarge the group 9 to 
be the semi-direct product Diff(X) © 9(X), with Diff(X) acting on 0F(X) by 
push-forward; that is, r) ■ f = f o rj^ 1 . Then 9 acts on Y — Y x x S 2 ' (X) by 

( V ,f) ■ (A;g) = ((n^TiA + dfix)); (r,' 1 )* ff) ■ (4C.13) 

The infinitesimal generator (£, x)y at the point (A; g) £ Y x is 

(^(X)(9)U^- + + (Xy(A))° (4C14) 
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where 

(£s»-\x)(9)U = (4C15) 

(Zy(A)) v = -A^ >v , (4C.16) 

and xy(A) is given by (|4C.11(I . From l|4C.7jl . the multimomentum map J : Z — > 
q* ® A 3 Z for this action is 

(j(x,A,p,$;g) t (S,x)) = 

{-r^ArCv+r^X,* +P^) d 3 x„ - r^ v dA T A d 2 x^. (4C.17) 

c A Topological Field Theory. Just as in Example b above, we take 
S = Diff(X) © &(X) acting on Y according to 

(r ) ,f)-A = (r ) - 1 r(A + df(x)). (4C.18) 

The corresponding infinitesimal generator is 

(^x)y(A) = (fr(A))„^- + (XY(A))„J^- + e{x)^ (4C.19) 

where (£y {A)) v an d Xy(A) are given by l|4C.16fl and I4C. 1 lfl . respectively. Then 
we compute 

(J(x»,A„,p,p^),(Z, x )) = 

(-p^ArC^+P^u+Pe) d 2 x^-p^CdA T A dV- (4C-20) 

d Bosonic Strings. Corresponding to Y = (X x M) Xx S 2 ' (X), we take 
the invariance group for bosonic strings to be the semi-direct product 

S = Diff(X)(S)Con^ 1 (X), 

where Con^' 1 ^) is the group of conformal transformations of S 2 ' (X) (which 
may be identified with the group ^(X, K + ) of positive real- valued functions on 
X) and rj G Diff(X) acts on A G Con^^X) by push-forward: 

rj- A = Aorj- 1 . (4C.21) 

Then (rj, A) G 9 acts on (x, 4>, h) G Y x by 

(77, A) • (0, h) = (4, or,, (r)~ 1 )*(A 2 (x)h)) . (4C.22) 

Hence, for (£, A) G g = £(AT) (§) ^(Ar), the corresponding infinitesimal generator 
(£, A)y at (0, ft,) G F is 

(M%^ + (frW)^+?p (40.23) 
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where 

(X Y (h)) ap = 2Xh ap (4C.24) 

and 

(£y WW = -(h^e, P + h m e,*)- (4C.25) 

Note that there is no component in the direction d/d<p A since <fi is a scalar field 
on X. From l|4C.7Jl . we get the covariant momentum map 

{J{x^ct> A ,K p ,p,p A ^q^),^,x)) 

= [ q °^ (2\h ap - K v e, P - Vf.) +pe]d 1 x ll 

~ (p A »Cdcj> A + q° p »t, v dh ap )e» v (4C.26) 
where we have written e M „ = d 2 x pu . 



Remark As we shall see in Part III the covariant momentum maps l|4C.12Jl . 
(|4C.20(I . and l|4C.26j) are closely related to the divergence, spatial flatness, and 
superhamiltonian and supcrmomentum constraints of electromagnetism, Chern- 
Simons theory, and string theory, respectively. ♦ 



The covariant momentum maps discussed and illustrated in this section do 
not involve the Lagrangian at all. In the next section, however, we will show that 
when a theory possesses symmetries, an important link between the Lagrangian 
and the covariant momentum map may be established. 

4D Symmetries and Noether's Theorem 

Having assembled the necessary background in §§4A-C, we are now able to state 
and prove Noether's Theorem. This fundamental result will be used in Part III 
to show that the energy-momentum map in the Lagrangian representation van- 
ishes identically on solutions of the Euler-Lagrange equations. As we shall see 
in Chapters 10 and 11, the vanishing of this energy-momentum map on a single 
Cauchy surface is often equivalent to the totality of the first class constraints of 
the theory. However, its vanishing on a whole family of hypersurfaces is equiv- 
alent to the entire Euler-Lagrange system under a suitable hypothesis on the 
gauge group; see §9C. 

Suppose that a group 9 acts on Y by bundle automorphisms. Prolong this 
action to J X Y by setting r) ■ 7 = rjjiy (7) as in §4A. We say that the Lagrangian 
density L is equivariant with respect to 9 if for all ij £ 9 and 7 £ J%Y, 



£(^y(7)) = tey£(7), 



(4D.1) 
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where (ilx 1 )* ^(7) means that the (n+l)-form £(7) at 1 e X is pushed forward 
to an (n + l)-form at r]x{x); equality in (|4D.1I) thus means equality of (n + In- 
forms at the point rj(x) . The infinitesimal version of this is the following equation 
obtained from (|3A.1|) and (|4A.4|) : 

& + W (A + wr„ { (A - - " A ^» + " B ^) + i£ " » " (4D ' 2 > 

The quantity S^L defined by the left hand side of this equation is called the 
variation of L in the direction £. 

One of the fundamental assumptions that we shall make in this work is the 
following: 



Al Covariance. The group 9 o-cts on Y by ttxy -bundle automorphisms and 
the Lagrangian density & is equivariant with respect to 9- 

This is a strong assumption but, amazingly, the gauge groups of most field 
theories satisfy it. One general class of exceptions are the topological field 
theories, whose gauge groups tend to leave but not £, invariant. We will 
discuss how to deal with one such case in Example c below. This assumption has 
a number of important consequences, some of which are stated in the following 
proposition. 

Proposition 4.6. Let £ be equivariant with respect to the lifted action of 9 on 
J X Y. Then 

(i) W£j is also equivariant; that is, rjz F£ = F£> Vj 1 y- Thus, the diagram 

jiy Z 

njiY l b z 



jiy Z 



commutes for every rj G 9 , 



(ii) the Cartan form 0£ is invariant; that is, r]j lY Qn = 0£ for all rj G 9, 
and 

(iii) the map J' Cj (C) := F£*J(£) : J 1 Y — > A n (J 1 Y) is a momentum map for 
the lifted action of 5 on J X Y relative to that is, for all £ G 0, 

Zjty J n c =dJ c (0, (4D.3) 

where £jiy = j £y is the infinitesimal generator corresponding to £. More- 
over, 

J l (0=^yJ&l- (4D.4) 

Proof. 
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(i) It is convenient to identify Z with J Y Y* and use Proposition 14.21 From 
(|4R5)l and (|aO)l . we have 

(T7 Jiy *(F/:(7)),y) - (r ?x 1 )*<FiL( 7 ), ?? 7 1 V(7')) 



t(7}+|tCr+^(V)-7]) 



e=0 



On the other hand, using we have 

(F£,(7, J i y ( 7 )),y) = £(^(7)) + ^(^y(7))+e[7'-^y(7)]) 
These are equal by the covariance condition Al. 



e=0 



(ii) The proof is analogous to that for Corollary 4.2.14 of Abraham and Mars- 
den [1978]. 



(iii) The infinitesimal version of (i) states 

^oF£ = TF£ o£ji Y . 



(4D.5) 



Pulling l|4C.3jl back from Z to J X Y along the Legendre transformation 
and using ll4D~5)l yields pfl3) . Similarly, pulling (|4C~5)) back by F£ 
yields l|4lT4)l . ■ 



Because of this theorem, we call J 10 the covariant momentum map in 
the Lagrangian representation. From (|4D.4|) and l|3B.3(l or, equivalently, 
from (|4C.7() and (|3A.2|I . we have the explicit formula 



(. 



dL 



V dv A „ 



L — 



dL 

dv A „ 



dL 
dv A , 



i u dy A A d' 



(4D.6) 



If <fi is a solution of the Euler-Lagrange equations, then from Theorem 13. II 

C?V)*(wj n c ) = 

for any vector field W on J 1 !^. In particular, setting = £jiy and applying 
(j 1 ^)* to l|4D.3|) . we obtain the following basic Noether conservation law: 

Theorem 4.7 (Divergence Form of Noether's Theorem). If the covari- 
ance assumption Al is satisfied, then for each £ € q, 

d[(iV)V*(0] =0 (4D.7) 

for any section <fi of ttxy satisfying the Euler-Lagrange equations. 
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The quantity (j'V)*«' (£) is called the Noether current, and Theorem 
4.7 is often referred to as the first Noether theorem. 6 We now proceed 
to compute coordinate expressions for the Noether current and its divergence 
without assuming that the field equations are satisfied. In the process, we will 
derive a criteria for when a theory has a converse to Noether's theorem; that is, 
when the Noether conservation law implies the Euler-Lagrange equations. 



From !|4D.6|) we get 



(97 



which, upon using the identity 



dx x A d™" 1 ^ = d% - ^ d"^, 



simplifies to 

(j'V)v £ (f) 



_5L_ 



d™^ 



(4D.8 



Defining the iie derivative of </> along £ by 



(4D.9) 



I4D . 8f) may be rewritten as 



(iV)(A^) A + i(jV)e 



(4D.10) 



where 4> is any section of ttxy, not necessarily satisfying the Euler-Lagrange 
equations. 

Next, taking the exterior derivative of the Noether current l|4D.8(l and using 
the identity 

d(F"d%) = d p Vdx p A d% = d^Vd^x 

(which justifies the name "divergence" for the resulting quantity), we obtain 
after some rearrangement and cancellation, 



6 The "second Noether theorem," which we call the "Vanishing Theorem," will be discussed 
in Chapter 9. 
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dL 

dv A „ 



dy< 



dx^ \ 



dL 



dv A 



(iV) 



d n+1 a 



(iV) 



(4D.11) 



Here SL/S(f> A is the Euler-Lagrange derivative of L defined in (|3C.1|) . which 
vanishes when the field equations are satisfied, and S^L is the variation of L 
defined in 14D.2(I . which vanishes when the Lagrangian is equivariant. Thus 
equation l|4D.ll(l again makes it obvious that the Noether conservation laws 
hold when the Lagrangian is equivariant and the Euler-Lagrange equations are 
satisfied. 

To obtain a converse, we say that the S-action is vertically transitive if 
for each x € X and each y G Y x and each (local) section through y, 

{(£^)(x)\^eg} = V v Y. (4D.12) 

Looking forward to Part II, §7A, we note that this is the infinitesimal version 
of the statement that at each point x E X, there exists a group element that 
transforms any given value of the fields at x into any other specified value. This 
is true for metrics and connections under the diffeomorphism group and for 
connections under bundle automorphisms, but is rarely true for other fields. 
Equation l|4D.ll|) now implies 

Theorem 4.8. Assume that the action of S on Y is vertically transitive and 
that the Lagrangian density is S- equivariant. Then a local section (p of Y is a 
solution to the Euler-Lagrange equations (3C.1) if and only if it satisfies the 
Noether conservation law (I4D.7|) for all £ £ g. 

To check whether the S-action for a given field theory is vertically transitive, 
we may use the equivalent formulations given in: 
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Proposition 4.9. Assume S acts on Y and let 

o(y) = {frfo) U e 0} 

denote the span of the infinitesimal generators of the action at y G 1"; i/iai is, 
ifte tangent space to the orbit through y. Vertical transitivity of the action of S 
on y is equivalent to either of the following conditions: 

(i) -For eacft y (z Y and each section <f> with <f>(x) — y, the image of <f> is 
transverse to g(y); that is, 

Q {y) + iuiT x ^ = TyY. (4D.13) 

(ii) For each y G Y and every choice of bundle coordinates {x^,y A ), we have 

{Z A (y)d A | £ G fl} = ^Y, (4D.14) 

Note that an action can be vertically transitive even if none of the £y are 
vertical; note also from 14D.9(I that — (£f(p)(x) is exactly the vertical component 
Of £y (#*))• 



Examples 

In each of the following examples, we compute the Noether current and its 
divergence without assuming the fields satisfy the Eulcr -Lagrange equations, 
and we then check for vertical transitivity. 



a Particle Mechanics. From the infinitesimal covariance condition i|4D.2(l . 
we see that a particle Lagrangian is Diff (R)-equi variant (that is to say, time 
reparametrization-invariant) if and only if it is time-independent and the energy 
E = v A (dL/dv A ) — L vanishes. Suppose also that there is a group G acting on Q 
whose prolongation to TQ leaves the Lagrangian invariant. Take S = Diff (M) x G 
and write elements of g as pairs (XiO- Then 14D.8(I reads 



C?V)V £ ( X ,0 = 



dL 

dv A 



(£ A - v A X ) + L X 



(J L (0-E X ) 



(4D.15) 



where J L (£) = {dL/dv A )^ A is the usual momentum map for G acting on Q 
in Lagrangian mechanics. Then 14D.7(1 asserts that on solutions of the Euler- 
Lagrange equations, 

j/ L (0 = 0- (4D.16) 
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This group S is vertically transitive iff G acts transitively on Q. 

In the special case of the relativistic free particle, the Lagrangian density 
(|3C.8(I is Diff (M)-equivariant. Then the corresponding momentum map van- 
ishes identically consistent with the fact that p = 0, cf. I|3C.10J| and l|4C.9|l . 
On the other hand, suppose that (Q,g) is Minkowski space and that G is the 
Poincare group. Then £ is G-invariant and (|4D.16|) expresses the fact that 
relativistic energy-momentum and angular momentum are constant along dy- 
namical trajectories. This is equivalent to the statement that the dynamical 
trajectories are geodesies of g. 



b Electromagnetism. Let 9 = ^(X) act on Y = A}X as in Example b of 
§4C. The Maxwell Lagrangian density (|3C.12I) is invariant under the prolonga- 
tion of J(X) to J 1 (A 1 X). In this case (I4LV7I) reads 

d[(/A)V £ ( X )] =&[( ] 1 Ay(F^J—gx,vd\ tl )] 

= d[(A^-A^) y/=gx^d 3 x M ] 

= [(A" v - A"*) J—g ] 4i X ,„ d 4 x = (4D.17) 

where we have used dx x A d 3 x^ = 5 X d A x and noted that the terms x,\iv have 
dropped out by symmetry. Since x is arbitrary, l|4D.17|l is equivalent to 

( AM ,v _ A"^).^ = (4D.18) 

which are Maxwell's equations. From (|4C.11|) it is clear that the group 3{X) is 
vertically transitive. 

Observe that when the spacetime metric g is fixed, the Maxwell Lagrangian 
density (|3(J.12|) is not Diff(X)-equivariant: £((77, 0) jiy (7)) = £(7), whereas 

( V ^)*L( 7 )=Si( 7 )J-\ 

J being the Jacobian determinant of rjx- We can remedy this by allowing g 
to vary parametrically, in which case the y/—g factor in £ will give rise to 
a factor of J which will cancel the offending factor J -1 in the above. Com- 
bined with our previous results, we see that parametrized electromagnetism is 
(Diff(X) (S) S'(X))-equivariant: 8^ X )L — 0. But there is a price to be paid for 
this extended equivariance: Noether's theorem no longer holds for J L = F£* J. 
Indeed, from l|4D.ll|l we see that even if the section A of A X X satisfies Maxwell's 
equations, 

((fA) x 9 yj L ^x)\ = -^{£sgUd*x 



crp 



= ^ p (£ i g) rTp d i x 
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need not vanish, where 

1 ap = - {^g ap F aP F a P + g pP F a °F f3 ^j y/=g (4D.19) 

is the stress-energy-momentum tensor of the electromagnetic field (cf. Gotay 
and Marsden [1992] and Interlude II). The reason, of course, is that g is not 
variational, whence 5L/8g ap = dL/dg ap ^ necessarily. 



c A Topological Field Theory. The behavior of the Chern-Simons field 
is exactly opposite to that of parametrized electromagnetism: the Lagrangian 
density H3C.18(I is equivariant under the action of Diff(X), but not under that 
of ^(A). Indeed, we compute 

%x) L = \^ VC F^X,.- (4D.20) 

Consequently the Legendre transformation is not equivariant, and the Cartan 
form is not invariant. 

Nonetheless, we can verify by direct computation that the Noether current 

(V CT (-A T r,, - A v ,rC + X,») + \e" Ta F„ T e^j A a d 2 x p (4D.21) 

is conserved; we obtain 

d [(jl4)V^, X )] = e^'F^ (A a A T + A T C. a + \x,^j 

which is zero when the Chern-Simons equations F pv = hold. Alternately, we 
could appeal to l|4D.ll|) and observe from (|4D.20(1 that 5^^L vanishes on shell. 

As in the case of electromagnetism, the action of Diff (A) ©^(A) on A 1 (A) 
is vertically transitive. Since in particular the value of X,<?( x ) is arbitrary, the 
conservation of the Noether current implies that F^ v {x) = A^^lx) — A^ v (x) = 
at each x S A. 



d Bosonic Strings. The harmonic map Lagrangian density defined by 
equation il3C.23(l is equivariant under the action of Diff (A) (§)Con 2 ' (A*) on 
Y = (A x M) x x S* 2 M (A). Using PTT^-MC ).25l) to compute the Lie deriva- 
tives £^4>, £ s h, and £\h via (|4D.9|) . 14D.11|) gives 

d[/(M)*J £ (f,A)] = 

(ST C T \ 

J^A^A" + {K P A V - 2A^ P + h a „C, P + h pv e,a)j d 2 x. (4D.22) 



;4D Symmetries and Noether's Theorem 51 



At a given point the quantities A, £ v , and £ v tP can be specified arbitrar- 

ily. The arbitrariness of £" tP implies that SL/Sh^p — 0, and so we obtain the 
"conformal" Euler-Lagrange equation (|3(J.30[) as a conserved Noether current. 
The arbitrariness of then gives 




which is not equivalent to the harmonic map equation l|3C.29(l . This is consistent 
with the fact that S is not vertically transitive; indeed (|4D.12|) is not satisfied 
since, by (|4C.23|) . (£, X)y has no component in the d/d<p A direction. 
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Interlude I — On Classical Field Theory 

Classical field theories come in many forms and varieties, and it is useful to 
attempt a broad classification of such theories, based upon certain features which 
play an important role in this work. The first of these features devolves upon 
the notion of gauge group which is, roughly speaking, the largest localizable 
group of transformations of the theory under which the Lagrangian density is 
equivariant (so that the group maps solutions of the Eulcr-Lagrange equations 
to solutions). The concept of gauge group is explored more fully in Part III, 
Chapter 8; for this discussion, we use the term loosely. It will be helpful in what 
follows to keep in mind the examples in Chapter 4, since the groups studied there 
are in fact the gauge groups of their respective theories. 

Consider a classical field theory with gauge group 9- Suppose that 9 C 
Aut(Y), the automorphism group of the covariant configuration bundle Y. We 
may distinguish two basic types of field theories based upon the relationship 
between the gauge group 9 and the (spacetime) diffeomorphism group Diff(X). 
The first consists of those which are parametrized in the sense that the natural 
homomorphism Aut(Y) — ► Diff(A) given by r\y i— * Tjx maps 9 onto Diff(A) (or 
at least onto a "sufficiently large" subgroup thereof, such as the compactly sup- 
ported diffeomorphisms). This terminology reflects the fact that such a theory is 
invariant under (essentially) arbitrary relabeling of the points of the parameter 
"spacetime" X. (In relativity theory, cf. Anderson [1967], one would say that X 
is a relative object in the theory.) The parametrized theory par excellence is of 
course general relativity, in which case 9 equals the spacetime diffeomorphism 
group. Another example is provided by the relativistic free particle, whose gauge 
group consists of (oriented) time reparametrizations. The Polyakov and Nambu 
strings are also parametrized theories, as are most topological field theories (see 
Horowitz [1989]). 

At the other extreme, there are the background theories in which 9 C 
Autid(F), the automorphisms of Y which cover the identity on X (or at least 
in which the image of 9 in Diff (X) is "sufficiently small" — the Poincare group, 
for instance). In this case X is called an absolute element of the theory, much 
like time in Newtonian mechanics, or Minkowski spacetime in special relativity. 
The prototypical background theory is electromagnctism treated on a fixed, 
non-interacting, background spacetime; here 9 consists of shifts of the vector 
potential by exact 1-forms and so is completely "internal." Yang-Mills-Higgs 
theory is similar. Traditionally, Newtonian mechanical systems are treated as 
background theories; usually, such systems do not even have gauge groups. 

In many ways, parametrized theories are preferred over background theories. 
According to the principle of general covariance, one should model physical sys- 
tems so that their descriptions contain no absolute objects. From a more prosaic 
standpoint, parametrized theories are easier to handle since, for example, they 
readily admit compatible slicings (necessary for the space + time decomposition 
of the theory, cf. Part II, §6A and §6D). Only in a parametrized theory can 
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one properly correlate the dynamics and the initial value constraints of a theory 
with its gauge freedom, cf. Part II §7F, and Part III Chapter 13; in fact, the sys- 
tem must effectively be parametrized to do Hamiltonian dynamics "correctly," 
as our development will show. For more information on parametrized theories 
compared to background theories, see Anderson [1967] and Kuchaf [1973]. 

Fortunately, most background theories can readily be converted into param- 
etrized theories. The conversion usually involves the promotion of previously 
fixed objects, notably a metric on A, into the ranks of the field variables. In 
this way one can often make the Lagrangian density Diff ( A )-equi variant, and 
thereby enlarge the gauge group to "include" the diffeomorphism group of X. 
Thus, for example, when one encounters a background theory, such as Yang- 
Mills-Higgs on Minkowski spacetime, one can parametrize it by simply "turning 
the metric on." This is precisely how we have parametrized clcctromagnctism 
in Example b. As long as no new terms are added to the Lagrangian and 
the previously fixed fields are not varied to give Euler-Lagrange equations, the 
converted theory, while now parametrized, remains physically equivalent to the 
original background theory. Kuchaf [1973] finds an alternate way to parametrize 
a background theory, by introducing certain slicing parameters, treating them 
as fields, and letting them evolve dynamically. Further details of this approach 
can be found in Kuchaf [1976]. 

Among either class of field theories, we may further distinguish metric and 
non-metric theories. Metric theories are those which carry a (Lorentzian, Rie- 
mannian or general signature) metric on the parameter space X. Most familiar 
classical field theories are metric, with topological field theories and theories 
whose action functionals measure length, area, volume, etc., being important 
exceptions. Most mechanical systems (relativistic or nonrelativistic) , on the 
other hand, are non-metric. (In this regard, we emphasize that our terminology 
refers to the metric structure of X, which in mechanics is usually taken to be 
the time line; the kinetic energy metrics familiar in mechanics live instead on 
the fibers of Y -> X. 7 ) 

In metric theories the metric may be an absolute object, in that it does not 
"evolve" or dynamically interact with any of the other elements of the theory 
(as in electromagnetism on a fixed spacetime background), or it may be treated 
as one of the field variables. Again, the principle of general covariance suggests 
that the metric should be treated as a relative object. When this is the case, the 
metric may be variational in the sense that it is varied to give field equations 
(as in theories of gravity), or it may appear essentially as a parameter in that it 
may take different values, but is not varied to give Euler-Lagrange equations. 
Electromagnetism on a variable but non-dynamic spacetime is an example of a 
theory in which the metric is treated parametrically in this fashion. Theories 
which contain dynamic or parametric metrics are typically parametrized whereas 
theories with fixed metrics are usually background theories. 

A word of caution: If a theory has some fields which are parametric (but 



7 Our terminology here is also at odds with the standard usage of "metric" and "non- 
metric" as applied to theories of gravity. 
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not variational) then the Noether conservation laws may not hold because the 
parametric field is not varied and so does not satisfy any field equations. We 
have already encountered this in Example b of §4D. 

As noted above, our formalism is best applied to parametrized theories. 
Likewise, it is often advantageous to work in the context of metric theories, with 
the metric being treated dynamically. This can sometimes be achieved using 
the "Polyakov trick," which transforms the Nambu string into the Polyakov 
string. We show how this goes for the relativistic free particle in Part III, 
§11C. We suspect that this procedure could be useful in treating some of the 
"nonstandard" vector field theories of Isenberg and Nester [1977]. 

We summarize our classification with a chart, indicating in which categories 
various field theories fall. 




Nonrelativistic Mechanics 

Electromagnetism 
Yang-Mills-Higgs 
Sigma Model 

Relativistic Fluids/Elasticity /Plasmas 

Relativistic Free Particle 
Nambu String 
Minimal Membrane 
Topological Field Theories 

Parametric Electromagnetism 
Parametric Yang-Mills-Higgs 
Parametric Sigma Model 
Parametric Fluids/Elasticity/Plasmas 

Gravity 

Einstein-Maxwell 
Einstein- Yang-Mills-Higgs 
Einstein-Sigma-Model 
Polyakov String 
Harmonic Membrane 
Polyakov Particle 
Einstein-Euler/Elasticity 
Einstein-Maxwell-Vlasov 



Figure 4.1: Classification of Field Theories 



In this work, we discuss example theories from each category. But there 
are many other theories, not touched upon here, to which our formalism may 
be applied. Two especially important ones are fluids and elasticity (cf. Bao, 
Marsden, and Walton [1985]). 
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Let us specifically consider the case of a fluid in the material representa- 
tion. The basic variables are the particle placement fields <f) : Y, x M. ^> M 
which have the physical interpretation that <j>({x} x R) is the world line of the 
particle with material label x G E, where S is the fluid "reference manifold" 
and M is spacetime. Thus Y is the trivial bundle (E x 1) x M. Consistent with 
standard practice in continuum mechanics, M carries the spacetime metric while 
Exl carries a reference metric h out of which things like the internal energy are 
built (cf. Marsden and Hughes [1983]). Observe that Diff(£ x R) encodes the 
material symmetries (including particle relabeling and time reparametrization 
symmetries). In this formulation, the theory is already parametrized; note also 
that the metric h on S x R is treated parametrically. 

This approach to fluids is in keeping with our treatment of strings. But it 
is not particularly suited to a discussion of general relativistic fluids. In this 
context it is more convenient to use the inverse material representation. 
The fields then include the gravitational field (treated variationally or paramet- 
rically) with the fluid described by particle labeling fields ip : X — > X x R, 
where ip — 4> ■ In addition to the material symmetries given by Diff(S x R) 
we have the spacetime- based DiS(X) gauge symmetries associated with the 
gravitational field. The same "switch" to the inverse material representation 
obviously can be made for strings, and similar remarks apply to elasticity (with 
some qualifications). 
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